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Modified Nucleotides 

The invention relates to modified nucleotides. In 
particular, this invention discloses nucleotides having a 
5 removable protecting group, their use in polynucleotide 

sequencing methods and a method for chemical deprotection 
of the protecting group. 

Advances in the study of molecules have been led, in 
part, by improvement in technologies used to characterise 
10 the molecules or their biological reactions. In 

particular, the study of the nucleic acids DNA and RNA 
has benefited from developing technologies used for 
sequence analysis and the study of hybridisation events. 
An example of the technologies that have improved 
15 the study of nucleic acids is the development of 

fabricated arrays of immobilised nucleic acids. These 
arrays consist typically of a high-density matrix of 
polynucleotides immobilised onto a solid support 
material. See, e.g., Fodor et al . , Trends Biotech. 
20 12:19-26, 1994, which describes ways of assembling the 

nucleic acids using a chemically sensitized glass surface 
protected by a mask, but exposed at defined areas to 
allow attachment of suitably modified nucleotide 
phosphoramidites . Fabricated arrays can also be 
25 manufactured by the technique of "spotting" known 

polynucleotides onto a solid support at predetermined 
positions (e.g., Stimpson et al . , Proc . Natl. Acad. Sci . 
USA 92:6379-6383, 1995). 

Sequencing by synthesis of DNA ideally requires the 
30 controlled (i.e. one at; a time) incorporation of the 
correct complementary nucleotide opposite the 
oligonucleotide being sequenced. This allows for 
accurate sequencing by adding nucleotides in multiple 
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cycles as each nucleotide residue is sequenced one at a 
time, thus preventing an uncontrolled series of 
incorporations occurring. The incorporated nucleotide is 
read using an appropriate label attached thereto before 
5 removal of the label moiety and the subsequent next round 
of sequencing. In order to ensure only a single 
incorporation occurs, a structural modification 
( M blocking group") of the sequencing nucleotides is 
required to ensure a single nucleotide incorporation but 

10 which then prevents any further nucleotide incorporation 
into the polynucleotide chain. The blocking group must 
then be removable, under reaction conditions which do not 
interfere with the integrity of the DNA being sequenced. 
The sequencing cycle can then continue with the 

15 incorporation of the next blocked, labelled nucleotide. 
In order to be of practical use, the entire process 
should consist of high yielding, highly specific chemical 
and enzymatic steps to facilitate multiple cycles of 
sequencing . 

20 To be useful in DNA sequencing, nucleotide, and more 

usually nucleotide triphosphates, generally require a 
3 'OH-blocking group so as to prevent the polymerase used 
to incorporate it into a polynucleotide chain from 
continuing to replicate once the base on the nucleotide 

15 is added. There are many limitations on the suitability 
of a molecule as a blocking group. It must be such that 
it prevents additional nucleotide molecules from being 
added to the polynucleotide chain whilst simultaneously 
being easily removable from the sugar moiety without 

0 causing damage to the polynucleotide chain. Furthermore, 
the modified nucleotide must be tolerated by the 
polymerase or other appropriate enzyme used to 
incorporate it into the polynucleotide chain. The ideal 
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blocking group- will therefore exhibit long term 
stability, be efficiently incorporated by the polymerase 
enzyme, cause total blocking of secondary or further 
incorporation and have the ability to be removed under 
5 mild conditions that do not cause damage to the 

polynucleotide structure, preferably under aqueous 
conditions. These stringent requirements are formidable 
obstacles to the design and synthesis of the requisite 
modified nucleotides, 
0 Reversible blocking groups for this purpose have 

been described previously but none of them generally meet 
the above criteria for polynucleotide, e.g. DNA- 
compat ible , chemistry . 

Metzker et al . , (Nucleic Acids Research, 22(20): 
5 4259-4267, 1994) discloses the synthesis and use of eight 
3' -modified 2 -deoxyribonucleoside 5 ' -triphosphates (3»- 
modified dNTPs) and testing in two DNA template . assays 
for incorporation activity. The 3 1 -modified dNTPs 
included 3' allyl deoxyriboadenosine 5 ' -triphosphate (3 • - 
) allyl dATP) . However, the 3'allyl blocked compound was 
not used to demonstrate a complete cycle of termination, 
deprotection and reinitiation of DNA synthesis: the only 
test results presented were those which showed the 
ability of this compound to terminate DNA synthesis in a 
single termination assay, out of eight such assays 
conducted, each conducted with a different DNA 
polymerase . 

WO02/29003 (The Trustees of Columbia University in 
the City of New York) describes a sequencing method which , 
may include the use of an allyl protecting group to cap 
the 3 * -OH group on a growing strand of DNA in a 
polymerase reaction. The allyl group is introduced 
according to the procedure of Metzker (infra) and is said 
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to be removed by using methodology reported by Kamal et 
al (Tet. Let, 40, 371-372, 1999). 

The Kamal deprotection methodology employs sodium 
iodide and chlorotrimethylsilane so as to generate in 
5 situ iodotrimethylsilane, in acetonitrile solvent, 

quenching with sodium thiosulfate. After extraction into 
ethyl acetate and drying (sodium sulfate) , then 
concentration under reduced pressure and column 
chromatography (ethyl acetate : hexane ; 2:3 as eluant) , 
10 free alcohols were obtained in 90-98% yield. 

In WO02/29003, the Kamal allyl deprotection is 
suggested as being directly applicable in DNA sequencing 
without modification, the Kamal conditions being mild and 
specific . 

15 While Metzker reports on the preparation of a 

3 'allyl -blocked nucleotide or nucleoside and WO02/29003 
suggests the use of the allyl functionality as a 3 1 -OH 
cap during sequencing, neither of these documents 
actually teaches the deprotection of 3'-allylated 

20 hydroxyl group in the context of a sequencing protocol . 

Whilst the use of an allyl group as a hydroxyl protecting 
group is well known - it is easy to introduce and is 
stable across the whole pH range and to elevated 
temperatures - there is to date, no concrete embodiment 

25 of the successful cleavage of a 3' -allyl group under DNA 
compatible conditions, i.e. conditions under which the 
integrity of the DNA is not wholly or partially 
destroyed. In other words, it has not been possible 
hitherto to conduct DNA sequencing using 3 , OH allyl - 

30 blocked nucleotides. 

The Kamal methodology is inappropriate to conduct in 
aqueous media since the TMS chloride will hydrolyse * 
preventing the in situ generation of TMS iodide. Attempts 
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• to carry out the Kamal deprotection (in acetonitrile) in 
sequencing have proven unsuccessful in our hands. 

The present invention is based on the surprising 
development of a number of reversible blocking groups and 
5 methods of deprotecting them under DNA compatible 

conditions. Some of these blocking groups are novel per 
se; others have been disclosed in the prior art but, as 
noted above, it has not proved possible to utilised these 
blocking groups in DNA sequencing. 

10 One feature of the invention derives from the 

development of a completely new method of allyl 
deprotection. Our procedure is of broad applicability to 
the deprotection of virtually all allyl -protected 
hydroxyl functionality and may be effected in aqueous 

15 solution, in contrast to the methodology of Kamal et al . 
(which is effected in acetonitrile) and to the other 
methods known generally in the prior art which are highly 
oxygen-and moisture-sensitive. A further feature of the 
invention derives from the development of a new class of 

20 protecting groups. These are based upon acetals and 

related protecting groups but do not suffer from some of 
the disadvantages of acetal deprotection known in the 
prior art . 

The allyl deprotection methodology makes use of a 
25 water-soluble transition metal catalyst formed from a 
transition metal and at least partially water-soluble 
ligands. In aqueous solution these form at least 
partially water-soluble transition metal complexes. By 
aqueous solution herein is meant a liquid comprising at 
30 least 20 vol*, preferably at least 50%, for example at 
least 75 vol%, particularly at least 95 vol% and 
especially greater than above 98 vol%, ideally 100 vbl% 
of water as the continuous phase. 



WO 2004/018497 



PCT/GB2003/003686 



As those skilled in the art will appreciate, the 
allyl group may be used to protect not only the hydroxy 1 
group but also thiol and amine functionalities. Moreover 
allylic esters may be formed from the reaction between 
5 carboxylic acids and allyl halides, for example. Primary 
or secondary amides may also be protected using methods 
known in the art . The novel deprotection methodology 
described herein may be used in the deprotection of all 
these allylated compounds, e.g. allyl esters and mono- or 
10 bisallylated primary amines or allylated amides, or in 
the deprotection of allylated secondary amines. The 
method is also suitable in the deprotection of allyl 
esters and thioethers . 

Protecting groups which comprise the acetal 
15 functionality have been used previously as blocking 
groups. However, removal of such groups and ethers 
requires strongly acidic deprotections detrimental to DNA 
molecules. The hydrolysis of an acetal however, results 
in the formation of an unstable hemiacetal intermediate 
20 which hydrolyses under aqueous conditions to the natural 
hydroxyl group. The inventors have utilised this concept 
and applied it further such that this feature of the 
invention resides in utilising blocking groups that 
include protecting groups to protect intermediate 
25 molecules that would normally hydrolyse under aqueous 
conditions. These protecting groups comprise a second 
functional group that stabilises the structure of the 
intermediate but which can be removed at a later stage 
following incorporation into the polynucleotide. 
30 Protecting groups have, been used in organic synthesis 
reactions to temporarily mask the characteristic 
chemistry of a functional group because it interferes 
with another reaction. 
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Therefore, according to a first aspect of the 
invention there is provided a modified nucleotide or 
nucleoside molecule comprising a purine or pyrimidine 
base and a ribose or deoxyribose sugar moiety having a 
5 removable 3 1 -OH blocking group covalently attached 

thereto, such that the 3" carbon atom has attached a 
group of the structure 

-O-Z 

wherein Z is any of -C (R' ) 2 -0-R" , -C (R' ) 2 -N (R" ) 2 , - 
10 C(R') 2 ~N(H)R", -C(R') 2 -S-R" and -C(R') 2 -F, 

wherein each R" is or is part of a removable 
protecting group; 

each R' is independently a hydrogen atom, an alkyl, 
substituted alkyl, arylalkyl, alkenyl, alkynyl, aryl, 
heteroaryl, heterocyclic, acyl , cyano, alkoxy, aryloxy, 
heteroaryloxy or ami do group, or a detectable label 
attached through a linking group; or (R' ) 2 represents an 
alkylidene group of formula =C(R''') 2 wherein each R' ' ' 
may be the same or different and is selected from the 
group comprising hydrogen and halogen atoms and alkyl 
groups ; and 

wherein said molecule may be reacted to yield an 
intermediate in which each R" is exchanged for H or, 
where Z is -C(R') 2 -F, the F is exchanged for OH, SH or 
25 NH 2 , preferably OH, which intermediate dissociates under 
aqueous conditions to afford a molecule with a free 3'OH; 

with the proviso that where Z is -C (R' ) 2 -S-R" , both 
R' groups are not H. 

Viewed from another aspect, the invention provides a 
30 3'-0-allyl nucleotide or nucleoside which nucleotide or 
nucleoside comprises a detectable label 
linked to the base of the nucleoside or nucleotide, 
preferably by a cleavable linker. 



20 
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In a further aspect, the invention provides a 
polynucleotide comprising a 3 ' -O-allyl nucleotide or 
nucleoside which nucleotide or nucleoside comprises a 
detectable label linked to the base of the nucleoside or 
5 nucleotide, preferably by a cleavable linker. 

Viewed from a still further aspect, the invention 
provides a method of converting a compound of formula R- 
O-allyl, R 2 N(allyl), RNH(allyl), RN(allyl) 2 or R-S-allyl 
to a corresponding compound in which the allyl group is 
10 removed and replaced by hydrogen, said method comprising 
the steps of reacting a compound of formula R-O-allyl, 
R 2 N(allyl), RNH (allyl ) , RN(allyl) 2 or R-S-allyl in 
aqueous solution with a transition metal comprising a 
transition metal and one or more ligands selected from 
15 the group comprising water-soluble phosphine and water- 
soluble nitrogen-containing phosphine ligands, wherein 
the or each R is a water-soluble biological molecule. 

In a further aspect the invention provides a method 
of controlling the incorporation of a nucleotide molecule 
20 complementary to the nucleotide in a target single- 
stranded polynucleotide in a synthesis or sequencing 
reaction comprising incorporating into the growing 
complementary polynucleotide a molecule according to the 
invention, the incorporation of said molecule preventing 
25 or blocking introduction of subsequent nucleoside or 
nucleotide molecules into 

said growing complementary polynucleotide. 

In a further aspect, the invention provides a method 
for determining the sequence of a target single -stranded 
30 polynucleotide, comprising monitoring the sequential 

incorporation of complementary nucleotides, wherein at 
least one incorporation, and preferably all of the 
incorporations is of a nucleotide according to the 
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invention as hereinbefore described which preferably 
comprises a detectable label linked to the base of the 
nucleoside or nucleotide by a cleavable linker and 
wherein the identity of the nucleotide incorporated is 
5 determined by detecting the label, said blocking group 

and said label being removed prior to introduction of the 
next complementary nucleotide. 

From a further aspect, the invention provides a 
method for determining the sequence of a target single - 
10 stranded polynucleotide, comprising: 

(a) providing a plurality of different nucleotides 
according to the hereinbefore described invention which 
nucleotides are preferably linked from the base to a 
detectable label by a cleavable linker and wherein the 

15 detectable label linked to each type of nucleotide can be 
distinguished upon detection from the detectable label 
used for other types of nucleotides; 

(b) incorporating the nucleotide into the 
complement of the target single-stranded polynucleotide; 

20 (c) detecting the label of the nucleotide of (b) , 

thereby determining the type of nucleotide incorporated; 

(d) removing the label of the nucleotide of (b) and 
the blocking group; and 

(e) optionally repeating steps (b) - (d) one or more 
25 times; 

thereby determining the sequence of a target single- 
stranded polynucleotide. 

Additionally, in another aspect, the invention 
provides a kit, comprising: 
30 (a) a plurality of different individual nucleotides 

of the invention; and 

(b) packaging materials therefor. 

The nucleosides or nucleotides according to or used 
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in the methods of the present invention comprise a purine 
or pyrimidine base and a ribose or deoxyribose sugar 
moiety which has a blocking group covalently attached 
thereto, preferably at the 3 f O position, which renders 
5 the molecules useful in techniques requiring blocking of 
the 3 1 -OH group to prevent incorporation of additional 
nucleotides, such as for example in sequencing reactions, 
polynucleotide synthesis, nucleic acid amplification, 
nucleic acid hybridisation assays, single nucleotide 
10 polymorphism studies, and other such techniques. 

Where the term "blocking group" is used herein in 
the context of the invention, this embraces both the 
allyl and W Z" blocking groups described herein. However, 
it will be appreciated that, in the methods of the 
15 invention as described and claimed herein, where mixtures 
of nucleotides are used, these very preferably each 
comprise the same type of blocking, i.e. allyl -blocked or 
"Z" -blocked. Where W Z" -blocked nucleotides are used, 
each "Z" group will generally be the same group, except 
20 in those cases where the detectable label forms part of 
the "Z" group, i.e. is not attached to the base. 

Once the blocking group has been removed, it is 
possible to incorporate another nucleotide to the free 
3 1 -OH group . 

25 The molecule can be linked via the base to a 

detectable label by a desirable linker, which label may 
be a fluorophore, for example. The detectable label may 
instead, if desirable, be incorporated into the blocking 
groups of formula "Z" . The linker can be acid labile, 

30 photolabile or contain.a disulfide linkage. Other 
linkages, in particular phosphine-cleavable azide- 
containing linkers, may be employed in the invention as 
described in greater detail. 
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Preferred labels and linkages included those 
disclosed in WO 03/048387. 

In the methods where nucleotides are incorporated, 
e.g. where the incorporation of a nucleotide molecule 
complementary to the nucleotide in a target single 
stranded polynucleotide is controlled in a synthesis or 
sequencing reaction of the invention, the incorporation 
of the molecule may be accomplished via a terminal 
transferase, a polymerase or a reverse transcriptase. 

Preferably, the molecule is incorporated by a 
polymerase and particularly from Thermococcus sp. , such 
as 9°N. Even more preferably, the polymerase is a mutant 
9°N A4 8 5L and even more preferably is a double mutant 
Y409V and A485L. 

In the methods for determining the sequence of a 
target single-stranded polynucleotide comprising 
monitoring the sequential incorporation of complementary 
nucleotides of the invention, it is preferred that the 
blocking group and the label may be removed in a single 
chemical treatment step. Thus, in a preferred embodiment 
of the invention, the blocking group is cleaved 
simultaneously with the label. This will of course be a 
feature inherent to those blocking groups of formula Z 
which incorporate a detectable label. 

Furthermore, preferably the blocked and labelled 
modified nucleotide constructs of the nucleotide bases A, 
T, C and G are recognised as substrates by the same 
polymerase enzyme. 

In the methods described herein, each of the 
nucleotides can be brought into contact with the target 
sequentially, with removal of non- incorporated 
nucleotides prior to addition of the next nucleotide, 
where detection and removal of the label and the blocking 
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group is carried out either after addition of each 
nucleotide, or after addition of all four nucleotides. 

In the methods, all of the nucleotides can be 
brought into contact with the target simultaneously, 
5 i.e., a composition comprising all of the different 

nucleotides is brought into contact with the target, and 
non- incorporated nucleotides are removed prior to 
detection and subsequent to removal of the label and the 
blocking group. 

10 The methods can comprise a first step and a second 

step, where in the first step, a first composition 
comprising two of the four types of modified nucleotides 
is brought into contact with the target, and non- 
incorporated nucleotides are removed prior to detection 

15 and subsequent to removal of the label and the blocking 

group, and where in the second step, a second composition 
comprising the two nucleotides not included in the first 
composition is brought into contact with the target, and 
non- incorporated nucleotides are removed prior to 

20 detection and subsequent to removal of the label and 

blocking group, and where the first steps and the second 
step can be optionally repeated one or more times. 

The methods described herein can also comprise a 
first step and a second step, where in the first step, a 

25 composition comprising one of the four nucleotides is 
brought into contact with the target, and non- 
incorporated nucleotides are removed prior to detection 
and subsequent to removal of the label and blocking 
group, and where in the second step, a second composition, 

30 comprising the three nucleotides not included in the 
first composition is brought into contact with the 
target, and non- incorporated nucleotides are removed 
prior to detection and subsequent to removal of the label 
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and blocking group, and where the first steps and the 
second step can be optionally repeated one or more times. 

The methods described herein can also comprise a 
first step and a second step, where in the first step, a 
5 first composition comprising three of the four 

nucleotides is brought into contact with the target, and 
non- incorporated nucleotides are removed prior to 
detection and subsequent to removal of the label and 
blocking group and where in the second step, a 
10 composition comprising the nucleotide not included in the 
first composition is brought into contact with the 
target, and non- incorporated nucleotides are removed 
prior to detection and subsequent to removal of the label 
and blocking group, and where the first steps and the 
15 second step can be optionally repeated one or more times. 
The incorporating step in the methods of the 
invention can be accomplished via a terminal transferase, 
a polymerase or a reverse transcriptase as hereinbefore 
defined. The detectable label and/or the cleavable 
20 linker can be of a size sufficient to prevent the 

incorporation of a second nucleotide or nucleoside into 
the nucleic acid molecule. 

In certain methods described herein for determining 
the sequence of a target single-stranded polynucleotide, 
25 each of the four nucleotides, one of which will be 

complementary to the first unpaired base in the target 
polynucleotide, can be brought into contact with the 
target sequentially, optionally with removal of non- 
incorporated nucleotides prior to addition of the next 
30 nucleotide. Determination of the success of the 

incorporation may be carried out either after provision 
of each nucleotide, or after the addition of all of the 
nucleotides added. If it is determined after addition of 
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fewer than four nucleotides that one has been 
incorporated, it is not necessary to provide further 
nucleotides in order to detect the nucleotides 
complementary to the incorporated nucleotide. 
5 Alternatively, all of the nucleotides can be brought 

into contact with the target simultaneously, i.e., a 
composition comprising all of the different nucleotide 
(i.e. A, T, C and G or A, U, C and G) is brought into 
contact with the target, and non- incorporated nucleotides 
10 removed prior to detection and removal of the label (s) . 

The methods involving sequential addition of nucleotides 
may comprise a first substep and optionally one or more 
subsequent substeps. In the first substep a composition 
comprising one, two or three of the four possible 
15 nucleotides is provided, i.e. brought into contact with, 
the target. Thereafter any unincorporated nucleotides 
may be removed and a detecting step may be conducted to 
determine whether one of the nucleotides has been 
incorporated. If one has been incorporated, the cleavage 
20 of the linker may be effected. In this way the identity 
of a nucleotide in the target polynucleotide may be 
determined. The nascent polynucleotide may then be 
extended to determine the identity of the next unpaired 
nucleotide in the target oligonucleotide. 
25 If the first substep above does not lead to 

incorporation of a nucleotide, or if this is not known, 
since the presence of incorporated nucleotides is not 
sought immediately after the first substep, one or more 
subsequent substeps may be conducted in which some or all % 
30 of those nucleotides not provided in the first substep 
are provided either, as appropriate, simultaneously or 
subsequently. Thereafter any unincorporated nucleotides 
may be removed and a detecting step conducted to 
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determine whether one of the classes of nucleotide has 
been incorporated. If one has been incorporated, 
cleavage of the linker may be effected. In this way the 
identity of a nucleotide in the target polynucleotide may 
5 be determined. The nascent polynucleotide may then be 
extended to determine the identity of the next unpaired 
nucleotide in the target oligonucleotide. If necessary, 
a third and optionally a fourth substep may be effected 
in a similar manner to the second substep. Obviously, 
10 once four substeps have been effected, all four possible 
nucleotides will have been provided and one will have 
been incorporated. 

It is desirable to determine whether a type or class 
of nucleotide has been incorporated after any particular 
15 combination comprising one, two or three nucleotides has 
been provided. In this way the unnecessary cost and time 
expended in providing the other nucleotide (s) is 
obviated. This is not a required feature of the 
invention, however . 
20 It is also desirable, where the method for 

sequencing comprises one or more substeps, to remove any 
unincorporated nucleotides before further nucleotide are 
provided. Again, this is not a required feature of the 
invention. Obviously, it is necessary that at least some 
25 and preferably as many as practicable of the 

unincorporated nucleotides are removed prior to the 
detection of the incorporated nucleotide. 

The kits of the invention include: (a) individual 
nucleotides according to the hereinbefore described 
30 invention, where each nucleotide has a base that is 

linked to a detectable label via a cleavable linker, or a 
detectable label linked via an optionally cleavable liner 
to a blocking group of formula Z, and where the 
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detectable label linked to each nucleotide can be 
distinguished upon detection from the detectable label 
used for other three nucleotides; and (b) packaging 
materials therefor. The kit can further include an 
5 enzyme for incorporating the nucleotide into the 

complementary nucleotide chain and buffers appropriate 
for the action of the enzyme in addition to appropriate 
chemicals for removal of the blocking group and the 
detectable label, which can preferably be removed by the 
10 same chemical treatment step. 

The nucleotides/nucleosides are suitable for use in 
many different DNA-based methodologies, including DNA 
synthesis and DNA sequencing protocols. 

The invention may be understood with reference to 
15 the attached drawings in which: 

Fig. 1 shows exemplary nucleotide structures useful 
in the invention. For each structure, X can be H, 
phosphate, diphosphate or triphosphate. R x and R 2 can be 
the same or different, and can be selected from H, OH, or 
20 any group which can be transformed into an OH, including, 
but not limited to, a carbonyl . Some suitable functional 
groups for Ri and R 2 include the structures shown in Fig. 
3 and Fig. 4 . 

Fig. 2 shows structures of linkers useful in certain 
25 aspects of the invention , including (1) disulfide 
linkers and acid labile linkers, (2) dialkoxybenzyl 
linkers, (3) Sieber linkers, (4) indole linkers and (5) 
t-butyl Sieber linkers. 

Fig. 3 shows some functional molecules usef ul , in the 
30 invention, including some cleavable linkers and some 

suitable hydroxyl protecting groups. In these structures, 
Ri and R 2 may be the same of different, and can be H, OH, 
or any group which can be transformed into an OH group, 
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including a carbonyl . R 3 represents one or more 
substituents independently selected from alkyl, alkoxyl, 
amino or halogen groups. R 4 and R 5 can be H or alkyl, 
and R 6 can be alkyl, cycloalkyl, alkenyl, cycloalkenyl or 
5 benzyl. X can be H, phosphate, diphosphate or 
triphosphate . 

Fig. 4 is a schematic illustration of some of the Z 
blocking groups that can be used according to the 
invention. 

10 Fig. 5 shows two cycles of incorporation of labelled 

and blocked DGTP, DCTP and dATP respectively (compounds 
18, 24 and 32) . 

Fig, 6 shows six cycles of incorporation of labelled 
and blocked DTTP (compound 6) . 
15 Fig. 7 shows the effective blocking by compound 38 (a 

3'-0allyl nucleotide of the invention). 

The present invention relates to nucleotide or 
nucleoside molecules that are modified by the reversible 
covalent attachment of a 3 1 -OH blocking groups thereto, 
20 and which molecules may be used in reactions where 

blocked nucleotide or nucleoside molecules are required, 
such as in sequencing reactions, polynucleotide synthesis 
and the like. 

Where the blocking group is an allyl group, it may 
25 be introduced into the 3 • -position using standard 
literature procedures such as that used by Metzker 
(infra) . 

The allyl groups are removed by reacting in aqueous 
solution a compound of formula R-O-allyl, R 2 N(allyl) , 
30 RNH (allyl ) , RN(allyl) 2 or R-S-allyl (wherein R is a 

water-soluble biological molecule) with a transition 
metal, wherein said transition metal is capable of 
forming a metal allyl complex, in the presence of one or 
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more ligands selected from the group comprising water- 
soluble phosphine and water-soluble mixed nitrogen- 
phosphine 1 igands . 

The water-soluble biological molecule is not 
5 particularly restricted provided, of course, it contains 
one or more hydroxyl, acid, amino, amide or thiol 
functionalities protected with an allyl group. Allyl 
esters are examples of compounds of formula R-O-allyl. 
Preferred functionalities are hydroxyl and amino. 
10 As used herein the term biological molecule is used 

to embrace any molecules or class of molecule which 
performs a biological role. Such molecules include for 
example, polynucleotides such as DNA and RNA, 
oligonucleotides and single nucleotides. In addition, 
15 peptides and peptide mimetics, such as enzymes and 

hormones etc., are embraced by the invention. Compounds 
which comprise a secondary amide linkage/ such as 
peptides, or a secondary amine, where such compounds are 
allylated on the nitrogen atom of the secondary amine or 
.20 amide, are examples of compounds of formula R 2 N(allyl) in 
which both R groups belong to the same biological 
molecule. Particularly preferred compounds however are 
polynucleotides, (including oligonucleotides) and 
nucleotides and nucleosides, preferably those which 
25 contain one base to which is attached a detectable label 
linked through a cleavable linker. Such compounds are 
useful in the determination of sequences of 
oligonucleotides as described herein. 

Transition metals of use in the invention are any 
30 which may form metal allyl complexes, for example 

platinum, palladium, rhodium, ruthenium, osmium and 
iridium. Palladium is preferred. 

The transition metal, e.g. palladium, is 
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• conveniently introduced as a salt, e.g. as a halide . 
Mixed salts such as Na 2 PdCl 4 may also be used. Other 
appropriate salts and compounds will be readily 
determined by the skilled person and are commercially 
5 available, e.g. f rom Aldrich Chemical Company. 

Suitable ligands are any phosphine or mixed 
nitrogen-phosphine ligands known to those skilled in the 
art, characterised in that the ligands are derivatised so 
as to render them water-soluble, e.g. by introducing one 
10 or more sulfonate, amine, hydroxyl (preferably a 

plurality of hydroxyl) or carboxylate' residues. Where 
amine residues are present, formation of amine salts may 
assist the solublisation of the ligand and thus the 
metal -allyl complex. Examples of appropriate ligands are 

15 triaryl phosphines, e.g. triphenyl phosphine, derivatised 
so as to make them water-soluble. Also preferred are 
trialkyl phosphines, e.g. tri-Ci-6-alkyl phosphines such 
as triethyl phosphines; such trialkyl phosphines are 
likewise derivatised so as to make them water-soluble. 

20 Sulfonate -containing and carboxylate-containing 

phosphines are particularly preferred; an example of the 
former 3 , 3 1 , 3 " -phosphinidyne tris (benzenesulf onic acid) 
which is commercially available from Aldrich Chemical 
Company as the trisodium salt; and a preferred example of 

25 the latter is tris (2- carboxyethyl) phosphine which is 
available from Aldrich as the hydrochloride salt. 

The derivatised water-soluble phosphines and 
nitrogen- containing phosphines described herein may be 
used as their salts (e.g. as the hydrochloride or sodium 

30 salts) or, for example, in the case of the sulfonic and 
carboxylic acid-containing phosphines described herein, 
as the free acids. Thus 3 , 3 r , 3 11 -phosphinidyne tris 
(benzenesulf onic acid) and tris (2 -carboxyethyl ) phosphines 
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may be introduced either as the triacids or the trisodium 
salts. Other appropriate salts will be evident to those 
skilled in the art. The existence in salt form is not 
particularly important provided the phosphines are 
soluble in aqueous solution. 

Other ligands which may be used to include the 
following : 



P ^V H 1: 



as the hydrochloride salt,neutral 
compound and the trisodium salt 




OH 



3 



as the hydrochloride salt, neutral 
compound and the trisodium salt 



Na0 3 




,S0 3 Na 




P NMe 2 



NMe 3 



SOjNa 
R!=OMe,CO2H,C02Na 



J 2 



Na0 3 S 




SOjNa 



PPh 



(CHj)n 
Ph 2 P \ 
n=l-5 

R=P03Na2, 
C02Na, 
S03Na, 
OH 



PPh, 



N-|__^— N 



Ph 2 P 




/ - i— / - SOjNa 
^-"^S RI=OMe.C02H,C02Na 
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The skilled person will be aware that the atoms 
chelated to the transition metal in the water soluble 
10 complex may be part of mono- or polydentate ligands. 

Some such polydentate ligands are shown above. Whilst 
monodentate ligands are preferred, the invention thus 
also embraces methods which use water-soluble bi-, tri- , 
tetra-, penta- and hexadentate water-soluble phosphine 
15 and water-soluble nitrogen-containing phosphine ligands 
The various aspects of the invention relating to 
allyl blocking groups are of particular utility in 
sequencing polynucleotides wherein the 3 ' -OH is 
allylated. However, when present, the 2 ■ -OH is equally 
20 amenable to allylation, and to deprotection according to 
the method of the invention if necessary. In fact any 
allylated alcohol may be deprotected according to the 
method of the invention. Preferred allylated alcohols, 
however, are those derived from primary and secondary 
25 alcohols. Particularly preferred are allylated 

nucleosides and nucleotides as described herein. It is 
possible to deprotect tertiary allylated alcohols - the 
reaction is simply slower (although deprotection may be 
in such, and other deprotect ions of this invention, 
30 accelerated if necessary by heating the solution, e.g. to 
4 0 °C, preferably 50 °C or higher such as approximately 
60 °C or even up to 8 0 °C) . 

It is also possible to deprotect allylated primary 
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or secondary amines and allylated thiols. 

As noted earlier, the aqueous solution in which 
allyl deprotection is effected need not be 100% (as the 
continuous phase) . However, substantially pure water 
5 (e.g. at least 98 vol% preferably about 100 vol%) is 
preferred. Cosolvents are generally not required 
although they can assist in the solublisation of the 
allylated substrate for the deallylation . Generally, 
biomolecules are readily soluble in water (e.g. pure 

10 water) in which the deprotection reaction described 

herein may be effected. If desirable, one or more water- 
miscible cosolvents may be employed. Appropriate 
solvents include acetonitrile or dimethyl sulfoxide, 
methanol, ethanol and acetone, methanol being preferred. 

15 Less preferred solvents include tetrahydrof uran (THF) and 
dioxane . 

In the method of allyl deprotection according to the 
invention, a soluble metal complex is formed comprising a 
transition metal and one or more water-soluble phosphine 

20 and water-soluble nitrogen-containing phosphine ligands. 
More than one type of water-soluble phosphine/nitrogen- 
containing phosphine ligand may be used in a deallylation 
reaction although generally only one type of these 
classes of ligand will be used in a given reaction. We 

25 believe the deallylation reaction to be catalytic. 

Accordingly, the quantity of transition metal, e.g. 
palladium, may be less than 1 mol% (calculated relative 
to the allyl -protected compound to be deprotected) . 
Advantageously the amount of catalyst may be much less 

30 than 1 mol%, e.g. <0.50 mol%, preferably <0.10 mol%, 

particularly <0.05mol%. Even lower quantities of metal 
may be used, for example <0.03 or even <0.01 mol%. As 
those skilled in the art will be aware, however, as 
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quantity of catalyst is reduced, so too is the speed of 
the reaction. The skilled person will be able to judge, 
in any instance, the precise quantity of transition metal 
and thus catalyst most optimally suited to any particular 
5 deallylation reaction. 

In contrast to the amount of metal required in 
forming the active catalyst, the quantity of water- 
soluble phosphorus-containing ligand(s) used must be 
greater than 1 molar equivalent (again calculated 

10 relative to the allyl -protected compound to be 

deprotected) . Preferably greater than 4, e.g. greater 
than 6, for example 8-12 molar equivalents of ligand may 
be used. Even higher quantities of ligand e.g. >20 mole 
equivalents may be used if desired. 

15 The skilled person will be able to determine the 

quantity of ligand best suited to any individual 
reaction. 

Where the blocking group is any of -C (R' ) 2 -0-R" , - 
C(R' ) 2 -N(R") 2 , -C(R' ) 2 -N(H)R", -C(R') 2 -S-R" and -C(R') 2 -F, 
20 i.e. of formula Z, each R' may be independently H or an 
alkyl 

The intermediates produced advantageously 
spontaneously dissociate under aqueous conditions back 
to the natural 3* hydroxy structure, which permits 

25 further incorporation of another nucleotide. Any 

appropriate protecting group may be used, as discussed 
herein. Preferably, Z is of formula -C (R' ) 2 -0-R" , - 
C(R') 2 -N(R") 2 , -C(R' ) 2 -N(H)R" and -C(R') 2 -SR". 
Particularly preferably, Z is of the formula -C(R') 2 - 

30 0-R", -C(R' ) 2 -N(R") 2 , and -C(R') 2 -SR". R" may be a 
benzyl group or a substituted benzyl group. 

One example of groups of structure -O-Z wherein Z 
is -C(R f ) 2 -N(R") 2 are those in which -N(R") 2 is azido 
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(-N 3 ) . One preferred such example is azidomethyl 
wherein each R' is H. Alternatively, R' in Z groups 
of formula -C(R') 2 -N 3 and other Z groups may be any of 
the other groups discussed herein. 

Examples of typical R' groups include Ci_ 6 alkyl, 
particularly methyl and ethyl, and the following (in 
which each structure shows the bond which connects the 
R' moiety to the carbon atom to which it is attached 
in the Z groups; the asterisks (*) indicate the points 
of attachment) : 



r*r* r* r* r* r* r* 

OH OMe OEt OPh CF 3 CH 2 (F) CH 2 (CI) 

r* • r* r' r • r* r* 

CH(CI) 2 CCI 3 CH(F) 2 CN CN CF 3 C(=0)R OHet 



15 




(wherein each R is an optionally substituted Ci-i 0 
alkyl group, an optionally substituted alkoxy group, a 
halogen atom or functional group such as hydroxyl, 
20 amino, cyano, nitro, carboxyl and the like) and "Het" 
is a heterocyclic (which may for example be a 
heteroaryl group) . These R' groups shown above are 
preferred where the other R' group is the same as the 
first or is hydrogen. Preferred Z groups are of 
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formula C(R') 2 N 3 in which the R' groups are selected 
from the structures given above and hydrogen; or in 
which (R')2 represents an alkylidene group of formula 
=C(R"') 2 , e.g. =C(Me) 2 . 
5 Where molecules contain Z groups of formula 

C(R')2N 3 , the azido group may be converted to amino by 
contacting such molecules with the phosphine or 
nitrogen-containing phosphines ligands described in 
detail in connection with the transition metal 

10 complexes which serve to cleave the allyl groups from 
compounds of formula PN-O-allyl, formula R-O-allyl, 
R 2 N (allyl ) , RNH (allyl) , RN(allyl) 2 and R-S-allyl. 
When transforming azido to amino, however, no 
transition metal is necessary. Alternatively, the 

15 azido group in Z groups of formula C(R') 2 N 3 may be 

converted to amino by contacting such molecules with 
the thiols, in particular water-soluble thiols such as 
dithiothreitol (DTT) . 

Where an R' group represents a detectable label 

20 attached through a linking group, the other R' group 
or any other part of *Z* will generally not contain a 
detectable label, nor will the base of the nucleoside 
or nucleotide contain a detectable label. Appropriate 
linking groups for connecting the detectable label to 

25 the 3 'blocking group will be known to the skilled 

person and examples of such groups are described in 
greater detail hereinafter. 

Exemplary of linkages in R' groups containing 
detectable labels are those which contain one or more 

30 amide bonds. Such linkers may also contain an 

arylene, e.g. phenylene, group in the chain (i.e. a 
linking moiety -Ar- where the phenyl ring is part of 
the linker by way of its 1,4-disposed carbon atoms). 
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The phenyl ring may be substituted at its non-bonded 
position with one or more substituents such as alkyl, 
hydroxyl, alkyloxy, halide, nitro, carboxyl or cyano 
and the like, particularly electron-withdrawing 
5 groups, which electron-withdrawing is either by 

induction or resonance. The linkage in the R' group 
may also include moieties such a -O-, -S(0) q , wherein 
q is 0, 1 or 2 or NH or Nalkyl. Examples of such Z 
groups are as follows: 




(wherein EWG stands for electron- withdrawing group; n 
is an integer of from 1 to 50, preferably 2-20, e.g. 3 
to 10; and fluor indicates a fluorophore) . An example 
30 of an electron-withdrawing group by resonance is 

nitro; a group which acts through induction is fluoro. 
The skilled person will be aware of other appropriate 
electron-withdrawing groups. In addition, it will be 
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understood that whilst a fluorophore is indicated as 
being the detectable label present, other detectable 
groups as discussed in greater detail hereinafter may 
be included instead. 
5 Where a detectable label is attached to a 

nucleotide at the 3 1 -blocking position, the linker 
need not be cleavable to have utility in those 
reactions, such as DNA sequencing, described herein 
which require the label to be "read" and removed 
10 before the next step of the reaction. This is because 
the label, when attached to the 3 'block, will become 
separated from the nucleotide when the intermediate 
compounds described herein collapse so as to replace 
the "Z" group with a hydrogen atom. As noted above, 
15 each R" is or is part of a removable protecting group. 
R" may be a benzyl group or is substituted benzyl 
group is an alternative embodiment. 

It will be appreciated that where it is possible 
to incorporate a detectable label onto a group R" , the 
20 invention embraces this possibility. Thus, where R" 
is a benzyl group, the phenyl ring may bear a linker 
group to which is attached a fluorophore or other 
detectable group. Introduction of such groups does 
not prevent the ability to remove such R"s and they do 
25 not prevent the generation of the desired unstable 

intermediates during deprotection of blocking groups 
of formula Z. 

As is known in the art, a "nucleotide" consists 
of a nitrogenous base, a sugar, and one or more 
30 phosphate groups. They are monomeric units of a 

nucleic acid sequence. In RNA, the sugar is a ribose, 
and in DNA a deoxyribose, i.e. a sugar lacking a 
hydroxyl group that is present in ribose. The 
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nitrogenous base is a derivative of purine or 
pyrimidine. The purines are adenine (A) and guanine 
(G) , and the pyrimidines are cytosine (C) and thymine 
(T) (or in the context of RNA, uracil (U) ) . The C-l 
5 atom of deoxyribose is bonded to N-l of a pyrimidine 
or N-9 of a purine. A nucleotide is also a phosphate 
ester or a nucleoside, with esterif icat ion occurring 
on the hydroxyl group attached to C-5 of the sugar. 
Nucleotides are usually mono, di- or triphosphates. 

10 A "nucleoside" is structurally similar to a 

nucleotide, but is missing the phosptiate moieties. An 
example of a nucleoside analogue would be one in which 
the label is linked to the. base and there is no 
phosphate group attached to the sugar molecule. 

15 Although the base is usually referred to as a 

purine or pyrimidine, the skilled person will 
appreciate that derivatives and analogues are 
available which do not alter the capability of the 
nucleotide or nucleoside to undergo Watson-Crick base 

20 pairing. "Derivative" or "analogue" means a compound 
or molecule whose core structure is the same as, or 
closely resembles that of, a parent compound, but 
which has a chemical or physical modification, such as 
a different or additional side group, or 2' and or 3' 

25 blocking groups, which allows the derivative 

nucleotide or nucleoside to be linked to another 
molecule. For example, the base can be a deazapurine. 
The derivatives should be capable of undergoing 
Watson-Crick pairing. "Derivative" and "analogue" 

30 also mean a synthetic nucleotide or nucleoside 
derivative having modified base moieties and/or 
modified sugar moieties. Such derivatives and analogs 
are discussed in, e.g., Scheit, Nucleotide Analogs 
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(John Wiley & Son, 1980) and Uhlman et al . , Chemical 
Reviews 90:543-584, 1990. Nucleotide analogs can also 
comprise modified phosphodiester linkages, including 
phosphorothioate , phosphorodithioate , alkyl - 
5 phosphonate, phosphoranilidate and phosphoramidate 

linkages. The analogs should be capable of undergoing 
Watson-Crick base pairing. "Derivative" , "analog" and 
"modified" as used herein, may be used 
interchangeably, and are encompassed by the terms 
10 "nucleotide" and "nucleoside" defined herein. 

In the context of the present invention, the term 
"incorporating" means becoming part of a nucleic acid 
(eg DNA) molecule or oligonucleotide or primer. An 
oligonucleotide refers to a synthetic or natural 
15 molecule comprising a covalently linked sequence of 
nucleotides which are formed by a phosphodiester or 
modified phosphodiester bond between the 3 1 position 
of the pentose on one nucleotide and the 5' position 
of the pentose on an adjacent nucleotide. 
20 The term "alkyl" covers straight chain, branched 

chain and cycloalkyl groups. Unless the context 
indicates otherwise, the term "alkyl" refers to groups 
having 1 to 10 carbon atoms, for example 1 to 8 carbon 
atoms, and typically from 1 to 6 carbon atoms, for 
25 example from 1 to 4 carbon atoms. Examples of alkyl 
groups include methyl, ethyl, propyl, isopropyl, n- 
butyl, isobutyl, tert -butyl, n-pentyl, 2-pentyl, 3- 
pentyl, 2 -methyl butyl, 3 -methyl butyl, and n-hexyl 
and its isomers. 
30 Examples of cycloalkyl groups are those having 

from 3 to 10 ring atoms, particular examples including 
those derived from cyclopropane, cyclobutane, 
cyclopentane, cyclohexane and cycloheptane , 
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bicycloheptane and decalin. 

Where alkyl (including cycloalkyl) groups are 
substituted, particularly where these form either both 
of the R' groups of the molecules of the invention, 
5 examples of appropriate substituents include halogen 
substituents or functional groups such as hydroxyl, 
amino, cyano, nitro, carboxyl and the like. Such 
groups may also be substituents, where appropriate, of 
the other R' groups in the molecules of the invention. 
10 The term amino refers to groups of type NR*R** , 

wherein R* and R** are independently selected from 
hydrogen, a Ci_ 6 alkyl group (also referred to as Ci- 6 
alkylamino or di-Ci- 6 alkylamino) . 

The term "halogen" as used herein includes 
15 fluorine, chlorine, bromine and iodine. 

The nucleotide molecules of the present invention 
are suitable for use in many different methods where 
the detection of nucleotides is required. 

DNA sequencing methods, such as those outlined in 
20 U.S. Pat. No. 5,302,50 9 can be carried out using the 
nucleotides. 

The present invention can make use of 
conventional detectable labels. Detection can be 
carried out by any suitable method, including 
25 fluorescence spectroscopy or by other optical means. 
The preferred label is a fluorophore, which, after 
absorption of energy, emits radiation at a defined 
wavelength. Many suitable fluorescent labels are 
known. For example, Welch et al . (Che/n. Eur. J. 
30 5 (3) : 951-960, 1999) discloses dansyl-functionalised 
fluorescent moieties that can be used in the present 
invention. Zhu et al . (Cytometry 28:206-211, 1997) 
describes the use of the fluorescent labels Cy3 and 
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Cy5, which can also be used in the present invention. 
Labels suitable for use are also disclosed in Prober 
et al. (Science 238:336-341, 1987); Connell et al. 
(BioTechniques 5 (4 ) : 342 -384 , 1987), Ansorge et al . 
5 (Nucl. Acids Res. 15 (11) :4593-4602, 1987) and Smith et 

al. (Nature 321:674, 1986). Other commercially 
available fluorescent labels include, but are not 
limited to, fluorescein, rhodamine (including TMR, 
texas red and Rox)., alexa, bodipy, acridine, coumarin, 
10 pyrene, benzanthracene and the cyanins. 

Multiple labels can also be used in the 
invention. For example, bi-f luorophore FRET cassettes 
(Tet. Let. 46:8867-8871, 2000) are well known in the 
art and can be utilised in the present invention. 
15 Multi-fluor dendrimeric systems (J. Amer. Chem. Soc . 
123:8101-8108, 2001) can also be used. 

Although fluorescent labels are preferred, other 
forms of detectable labels will be apparent as useful 
to those of ordinary skill. For example, 
20 microparticles, including quantum dots (Empodocles et 
al. t Nature 399:126-130, 1999), gold nanoparticles 
(Reichert et al., Anal. Chem. 72:6025-6029, 2000) and 
microbeads (Lacoste et al . , Proc. Natl. Acad. Sci USA 
97 (17) :9461-9466, 2000) can all be used. 
25 Multi -component labels can also be used in the 

invention. A multi -component label is one which is 
dependent on the interaction with a further compound 
for detection. The most common multi -component label 
used in biology is the biotin-streptavidin system. 
30 Biotin is used as the label attached to the nucleotide 
base. Streptavidin is then added separately to enable 
detection to occur. Other multi -component systems are 
available. For example, dinitrophenol has a 
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commercially available fluorescent antibody that can 
be used for detection. 

The invention has been and will be further 
described with reference to nucleotides. However, 
5 unless indicated otherwise, the reference to 

nucleotides is also intended to be applicable to 
nucleosides. The invention will also be further 
described with reference to DNA, although the 
description will also be applicable to RNA, PNA, and 
10 other nucleic acids, unless otherwise indicated. 

The modified nucleotides of the 'invention may use 
a cleavable linker to attach the label to the 
nucleotide. The use of a cleavable linker ensures 
that the label can, if required, be removed after 
15 detection, avoiding any interfering signal with any 
labelled nucleotide incorporated subsequently. 

Generally, the use of cleavable linkers is 
preferable, particularly in the methods of the 
invention hereinbefore described except where the 
20 detectable label is attached to the nucleotide by 
forming part of the "Z" group. 

Those skilled in the art will be aware of the 
utility of dideoxynucleoside triphosphates in so- 
called Sanger sequencing methods, and related 
25 protocols (Sanger-type) , which rely upon randomised 

chain-termination at a particular type of nucleotide. 
An example of a Sanger-type sequencing protocol is the 
BASS method described by Metzker (infra) . Other 
Sanger-type sequencing methods will be known to those 
30 skilled in the art. 

Sanger and Sanger-type methods generally 
operate by the conducting of an experiment in which 
eight types of nucleotides are provided, four of which 
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contain a 3'OH group; and four of which omit the OH 
group and which are labeled differently from each 
other. The nucleotides used which omit the 3 'OH group 
- dideoxy nucleotides - are convent ially abbreviated 
5 to ddNTPs. As is known by the skilled person, since 

the ddNTPs are labeled differently, by determining the 
positions of the terminal nucleotides incorporated, 
and combining this information, the sequence of the 
target oligonucleotide may be determined. 
10 The nucleotides of the present invention, it will 

be recognized, may be of utility in Sanger methods and 
related protocols since the same effect achieved by 
using ddNTPs may be achieved by using the novel 3 ■ -OH 
blocking groups described herein: both prevent 
15 incorporation of subsequent nucleotides. 

The use of the nucleotides according to the 
present invention in Sanger and Sanger- type sequencing 
methods, wherein the linker connecting the detectable 
label to the nucleotide may or may not be cleavable, 
20 forms a still further aspect of this invention. 

Viewed from this aspect, the invention provides the 
use of such nucleotides in a Sanger or a Sanger- type 
sequencing method. 

Where 3 1 -OH Z-blocked nucleotides according to 
25 the present invention are used, it will be appreciated 
that the detectable labels attached to the nucleotides 
need not be connected via cleavable linkers, since in 
each instance where a labelled nucleotide of the 
invention is incorporated, no nucleotides need to be 
30 subsequently incorporated and thus the label need not 
be removed from the nucleotide. 

Moreover, it will be appreciated that monitoring 
of the incorporation of 3 1 OH blocked nucleotides may 
be determined by use of radioactive 32 P in the 
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phosphate groups attached. These may be present in 
either the ddNTPs themselves or in the primers used 
for extension. Where the blocking groups are of 
formula "Z" , this represents a further aspect of the 
5 invention. 

Viewed from this aspect, the invention provides 
the use of a nucleotide having a 3'OH group blocked 
with a "Z" group in a Sanger or a Sanger- type 
sequencing method. In this embodiment, a 32 P 
10 detectable label may be present in either the ddNTPs 
used in the primer used for extension. 

Cleavable linkers are known in the art, and 
conventional chemistry can be applied to attach a 
linker to a nucleotide base and a label. The linker 
15 can be cleaved by any suitable method, including 

exposure to acids, bases, nucleophiles , elect rophiles, 
radicals, metals, reducing or oxidising agents, light, 
temperature, enzymes etc. The linker as discussed 
herein may also be cleaved with the same catalyst used 
20 to cleave the 3'0-blocking group bond. Suitable 

linkers can be adapted from standard chemical blocking 
groups, as disclosed in Greene & Wuts, Protective 
Groups in Organic Synthesis, John Wiley & Sons. 
Further suitable cleavable linkers used in solid-phase 
25 synthesis are disclosed in Guillier et al . (Chem. Rev. 
100:2092-2157, 2000) . 

The use of the term "cleavable linker" is not 
meant to imply that the whole linker is required to be 
removed from e.g., the nucleotide base. Where the 
30 detectable label is attached to the base, the 

nucleoside cleavage site can be located at a position 
on the linker that ensures that part of the linker 
remains attached to the nucleotide base after 
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Where the detectable label is attached to the 
base, the linker can be attached at any position on 
the nucleotide base provided that Watson-Crick base 
5 pairing can still be carried out. In the context of 
purine bases, it is preferred if the linker is 
attached via the 7 -position of the purine or the 
preferred deazapurine analogue, via an 8 -modified 
purine, via an N-6 modified adenosine or an N-2 
10 modified guanine. For pyrimidines, attachment is 

preferably via the 5 -position on cytosine, thymidine 
or uracil and the N-4 position on cytosine. Suitable 
nucleotide structures are shown in Fig. 1. For each 
structure in Fig. 1 X can be H, phosphate, diphosphate 
15 or triphosphate. R x and R 2 can be the same or 

different, and are selected from H, OH, O-allyl, or 
formula Z as described herein or any other group which 
can be transformed into an OH, including, but not 
limited to, a carbonyl, provided that at least one of 
20 Ri and R 2 is O-allyl or formula 2 as described herein. 
Some suitable functional groups for R x and R 2 include 
the structures shown in Figs. 3 and 4. 

Suitable linkers are shown in Fig. 3 and include, 
but are not limited to, disulfide linkers (1) , acid 
25 labile linkers (2, 3, 4 and 5; including 

dialkoxybenzyl linkers (e.g., 2), Sieber linkers 
(e.g., 3), indole linkers (e.g., 4), t-butyl Sieber 
linkers (e.g. , 5) ) , electrophilically cleavable 
linkers, nucleophilically cleavable linkers, 
30 photocleavable linkers, cleavage under reductive 

conditions, oxidative conditions, cleavage via use of 
safety-catch linkers, and cleavage by elimination 
mechanisms . 
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A. Electrophilically cleaved linkers. 

Electrophilically cleaved linkers are typically 
cleaved by protons and include cleavages sensitive to 
acids. Suitable linkers include the modified benzylic 
5 systems such as trityl, p-alkoxybenzyl esters and p- 
alkoxybenzyl amides. Other suitable linkers include 
tert -butyl oxycarbonyl (Boc) groups and the acetal 
system. 

The use of thiophilic metals, such as nickel, 
10 silver or mercury, in the cleavage of thioacetal or 

other sulfur- containing protecting groups can also be 
considered for the preparation of suitable linker 
molecules . 

15 B. Nucleophilically cleaved linkers. 

Nucleophilic cleavage is also a well recognised 
method in the preparation of linker molecules. Groups 
such as esters that are labile in water (i.e., can be 
cleaved simply at basic pH) and groups that are labile 

20 to non-aqueous nucleophiles , can be used. Fluoride 
ions can be used to cleave silicon-oxygen bonds in 
groups such as triisopropyl silane (TIPS) or 
t-butyldimethyl silane (TBDMS) . 

25 C. Photocleavable linkers. 

Photocleavable linkers have been used widely in 
carbohydrate chemistry. It is preferable that the 
light required to activate cleavage does not affect 
the other components of the modified nucleotides. For 

30 example, if a fluorophore is used as the label, it is 
preferable if this absorbs light of a different 
wavelength to that required to cleave the linker 
molecule. Suitable linkers include those based on O- 
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nitrobenzyl compounds and nitroveratryl compounds. 
Linkers based on benzoin chemistry can also be used 
(Lee et al., J. Org. Chem. 64:3454-3460, 1999). 

5 D. Cleavage under reductive conditions 

There are many linkers known that are susceptible 
to reductive cleavage. Catalytic hydrogenation using 
palladium-based catalysts has been used to cleave 
benzyl and benzyloxycarbonyl groups. Disulfide bond 
10 reduction is also known in the art. 

E. Cleavage under oxidative conditions 

Oxidation-based approaches are well known in the 
art. These include oxidation of p-alkoxybenzyl groups 
15 and the oxidation of sulfur and selenium linkers. The 
use of aqueous iodine to cleave disulfides and other 
sulfur or selenium-based linkers is also within the 
scope of the invention. 

20 F. Safety-catch linkers 

Safety-catch linkers are those that cleave in two 
steps. In a preferred system the first step is the 
generation of a reactive nucleophilic center followed 
by a second step involving an intra-molecular 

25 cyclization that results in cleavage. For example, 

levulinic ester linkages can be treated with hydrazine 
or photochemistry to release an active amine, which 
can then be cyclised to cleave an ester elsewhere in 
the molecule (Burgess et al . , J. Org. Chem. 62:5165- 

30 5168, 1997) . 

G. Cleavage by elimination mechanisms 

Elimination reactions can also be used. For 
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example, the base-catalysed elimination of groups such 
as Fmoc and cyanoethyl , and palladium-catalysed 
reductive elimination of allylic systems, can be used. 

5 As well as the cleavage site, the linker can 

comprise a spacer unit. The spacer distances e.g., 
the nucleotide base from the cleavage site or label. 
The length of the linker is unimportant provided that 
the label is held a sufficient distance from the 
10 nucleotide so as not to interfere with any interaction 
between the nucleotide and an enzyme/ 

In a preferred embodiment the linker may consist 
of the same functionality as the block. This will 
make the deprotection and deblocking process more 
15 efficient, as only a single treatment will be required 
to remove both the label and the block. 

Particularly preferred linkers are phosphine- 
cleavable azide containing linkers. 

A method for determining the sequence of a target 
20 polynucleotide can be carried out by contacting the 
target polynucleotide separately with the different 
nucleotides to form the complement to that of the 
target polynucleotide, and detecting the incorporation 
of the nucleotides. Such a method makes use of 
25 polymerisation, whereby a polymerase enzyme extends 

the complementary strand by incorporating the correct 
nucleotide complementary to that on the target. The 
polymerisation reaction also requires a specific 
primer to initiate polymerisation. 
30 For each cycle, the incorporation of the modified 

nucleotide is carried out by the polymerase enzyme, 
and the incorporation event is then determined. Many 
different polymerase enzymes exist, and it will be 
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evident to the person of ordinary skill which is most 
appropriate to use. Preferred enzymes include DNA 
polymerase I, the Klenow fragment, DNA polymerase III, 
T4 or T7 DNA polymerase, Taq polymerase or Vent 
5 polymerase. Polymerases engineered to have specific 
properties can also be used. As noted earlier, the 
molecule is preferably incorporated by a polymerase 
and particularly from Thermococcus sp. , such as 9°N. 
Even more preferably, the polymerase is a mutant 9°N 
10 A485L and even more preferably is a double mutant * 
Y409V and A485L. An example of one such preferred 
enzyme is Thermococcus sp. 9°N exo -Y409V A485L 
available from New England Biolabs. Examples of such 
appropriate polymerases are disclosed in Proc. Natl. 
15 Acad. Sci. USA, 1996(93), pp 5281-5285, Nucleic Acids 
Research, 1999 (27), pp 2454-2553 and Acids Research, 
2002 (30) , pp 605-613. 

The sequencing methods are preferably carried out 
with the target polynucleotide arrayed on a solid 
20 support. Multiple target polynucleotides can be 
immobilised on the solid support through linker 
molecules, or can be attached to particles, e.g., 
microspheres, which can also be attached to a solid 
support material. The polynucleotides can be 
25 attached to the solid support by a number of means, 
including the use of biotin-avidin interactions. 
Methods for immobilizing polynucleotides on a solid 
support are well known in the art, and include 
lithographic techniques and "spotting" individual 
30 polynucleotides in defined positions on a solid 

support. Suitable solid supports are known in the 
art, and include glass slides and beads, ceramic and 
silicon surfaces and plastic materials. The support 
is usually a flat surface although microscopic beads 
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(microspheres) can also be used and can in turn be 
attached to another solid support by known means. The 
microspheres can be of any suitable size, typically in 
the range of from 10 nm to 100 nm in diameter. In a 
5 preferred embodiment, the polynucleotides are attached 
directly onto a planar surface, preferably a planar 
glass surface. Attachment will preferably be by means 
of a covalent linkage. Preferably, the arrays that 
are used are single molecule arrays that comprise 
10 polynucleotides in distinct optically resolvable 

areas, e.g., as disclosed in International Application 
No. WO00/06770. 

The sequencing method can be carried out on both 
single polynucleotide molecule and multi- 
15 polynucleotide molecule arrays, i.e., arrays of 
distinct individual polynucleotide molecules and 
arrays of distinct regions comprising multiple copies 
of one individual polynucleotide molecule. Single 
molecule arrays allow each individual polynucleotide 
20 to be resolved separately. The use of single molecule 
arrays is preferred. Sequencing single molecule 
arrays non-destructively allows a spatially 
addressable array to be formed. 

The method makes use of the polymerisation 
25 reaction to generate the complementary sequence of the 
target. Conditions compatible with polymerization 
reactions will be apparent to the skilled person. 

To carry out the polymerase reaction it will 
usually be necessary to first anneal a primer sequence 
30 to the target polynucleotide, the primer sequence 

being recognised by the polymerase enzyme and acting 
as an initiation site for the subsequent extension of 
the complementary strand. The primer sequence may be 
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added as a separate component with respect to the 
target polynucleotide. Alternatively, the primer and 
the target polynucleotide may each be part of one 
single stranded molecule, with the primer portion 
5 forming an intramolecular duplex with a part of the 
target, i.e., a hairpin loop structure. This 
structure may be immobilised to the solid support at 
any point on the molecule. Other conditions necessary 
for carrying out the polymerase reaction, including 

10 temperature, pH, buffer compositions etc., will be 
apparent to those skilled in the art.' 

The modified nucleotides of the invention are 
then brought into contact with the target 
polynucleotide, to allow polymerisation to occur. The 

15 nucleotides may be added sequentially, i.e., separate 
addition of each nucleotide type (A, T, G or C) , or 
added together. If they are added together, it is 
preferable for each nucleotide type to be labelled 
with a different label. 

20 This polymerisation step is allowed to proceed 

for a time sufficient to allow incorporation of a 
nucleotide. 

Nucleotides that are not incorporated are then 
removed, for example, by subjecting the array to a 

25 washing step, and detection of the incorporated labels 
may then be carried out . 

Detection may be by conventional means, for 
example if the label is a fluorescent moiety, 
detection of an incorporated base may be carried out 

30 by using a confocal scanning microscope to scan the 
surface of the array with a laser, to image a 
fluorophore bound directly to the incorporated base. 
Alternatively, a sensitive 2-D detector, such as a 
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charge -coupled detector (CCD) , can be used to 
visualise the individual signals generated. However, 
other techniques such as scanning near- field optical 
microscopy (SNOM) are available and may be used when 
5 imaging dense arrays. For example/ using SNOM, 

individual polynucleotides may be distinguished when 
separated by a distance of less than 100 nm, e.g., 10 
nm to 10 /zm. For a description of scanning near-field 
optical microscopy, see Moyer et al., Laser Focus 
10 World 29:10, 1993. Suitable apparatus used for 
imaging polynucleotide arrays are known and the 
technical set-up will be apparent to the skilled 
person. 

After detection, the label may be removed using 
15 suitable conditions that cleave the linker and the 
v 3'OH block to allow for incorporation of further 
modified nucleotides of the invention. Appropriate 
conditions may be those described herein for allyl 
group and for W Z" group deprotections . These 
20 conditions can serve to deprotect both the linker (if 
cleavable) and the blocking group. Alternatively, the 
linker may be deprotected separately from the allyl 
group by employing methods of cleaving the linker 
known in the art (which do not sever the O-blocking 
25 group bond) followed by deprotect ion. 

This invention may be further understood with 
reference to the following examples which serve to 
illustrate the invention and not to limit its scope. 
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3 1 -OH protected with an azidomethyl group as a 
protected form of a hemiaminal : 

I 

5 

Nucleotides bearing this blocking group at the 
3 ! position have been synthesised, shown to be 
successfully incorporated by DNA polymerases, block 
efficiently and may be subsequently removed under 
10 neutral, aqueous conditions using water soluble 
phosphines or thiols allowing further extension: 




15 5- [3- (2,2, 2-trifluoroacetamido) -prop- 1-ynyl] -2' - 
deoxyuridine (1) . 

To a solution of 5 -iodo- 2 ' -deoxyuridine (1.05 g, 
2,96 mmol) and Cul (114 mg, 0.60 mmol) in dry DMF 

20 (21 ml) was added triethylamine (0.9 ml). After 

stirring for 5 min trif luoro-N-prop-2-ynyl-acetamide 
(1.35 g, 9.0 mmol) and Pd(PPh 3 ) 4 (330 mg, 0.29 mmol) 
were added to the mixture and the reaction was stirred 
at room temperature in the dark for 16 h. Metanol 

25 (MeOH) (4 0 ml) and bicarbonate dowex added to the 

reaction mixture and stirred for 4 5 min. The mixture 
was filtered and the filtrate washed with MeOH and the 
solvent was removed under vacuum. The crude mixture 
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was purified by chromatography on silica (ethyl 
acetate (EtOAc) to EtOAc :MeOH 95:5) to give slightly 
yellow crystals (794 mg, 71 %) . X H NMR (d 6 
dimethylsulf oxide (DMSO) ) 6 2.13-2,17 (m, 2H, H-2'), 
5 3.57-3.65 (m, 2H, H-5'), 3.81-3.84 (m, 1H, H-4'), 

4.23-4.27 (m, 3H, H-3', CH 2 N) , 5.13 (t, J = 5.0 Hz, 
1H, OH) , 5.20 (d, J = 4.3 Hz, 1H, OH) , 6.13 (t, J" = 
6.7Hz, 1H, H-l'), 8.23 (s, 1H, H-6) , 10.11 (t, 
J = 5.6 Hz, 1H, NH) , 11.70 (br s, 1H, NH) . Mass (-ve 
10 electrospray) calcd for Ci4H 14 F3N 3 0 6 377.08, found 3 76. 




15 5' -O- (tert-butydimethylsilyl) -5- [3- (2,2,2- 

trif luoroacetamido) -prop-l-ynyl] -2 ' -deoxyuridine (2) . 

To a solution of (1) (656 mg, 1.74 mmol) in dry DMF 
(15 ml) was added t-butyldimethylsilyl chloride 

20 (288 mg, 1.91 mmol) in small portions, followed by 
imidazole (130 mg, 1.91 mmol). The reaction was 
followed by TLC and was completed after stirring for 
8 h at room temperature. The reaction was quenched 
with sat. aq. NaCl solution. EtOAc (2 5 ml) was added 

25 to the reaction mixture and the aqueous layer was 
extracted with EtOAc three times. After drying the 
combined organics (MgSQ 4 ) , the solvent was removed 
under vacuum. Purification by chromatography on silica 
(EtOAc : petroleum ether 8:2) gave (2) as slightly 

30 yellow crystals (676 mg, 83 %) . X H NMR (d 6 DMSO) 6 0.00 
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(s, 6H, CH 3 ) , 0.79 (s, 9H, tBu) , 1,93 -2.00 (m, 1H, H- 
2'), 2.06- 2.11 (m, 1H, H-2/), 3.63-3.75 (m, 2H, H- 
5'), 3.79-3.80 (m, 1H, H-4'), 4.12-4.14 (m, 3H, H-3', 
CH 2 N) , 5.22 (d, J = 4.1 Hz, 1H # OH), 6.03 (t, 
J = 6.9 Hz, 1H, H-l'), 7.86 (s, 1H, H-6) , 9.95 (t, J = 
5.4 Hz, 1H, NH) , 11.61 <br s, 1H, NH) . Mass ( - ve 
electrospray) calcd for C 2 qH 2B F 2 N 3 0 6 S± 491.17, found 
490. 




5' -O- (tert-Butydimethylsilyl) -3 ' -O-methylthiomethyl-5- 
[3- (2,2,2-trif luoroacetamido) -prop- 1-ynyl] -2 ' - 
15 deoxyuridine (3) . 

To a solution of (2) (1.84 g, 3.7 mmol) in dry DMSO 
(7 ml) was added acetic acid (3.2 ml) and acetic 
anhydride (10.2 ml). The mixture was stirred for 2 

20 days at room temperature, before it was quenched with 
sat. aq. NaHC0 3 . EtOAc (50 ml) was added and the 
aqueous layer was extracted three times with ethyl 
acetate. The combined organic layers were washed with 
sat. aq. NaHC0 3 solution and dried (MgSO<) . After 

25 removing the solvent under reduced pressure, the 

product (3) was purified by chromatography on silica 
(EtOAc : petroleum ether 8:2) yielding a clear sticky 
oil (1.83 g, 89 %) . X H NMR (d 6 DMSO) : 5 0.00 (s, 6H, 
CH 3 ) , 0.79 (s, 9H, tBu), 1.96-2.06 (m, 1H, H-2'), 1.99 



WO 2004/018497 



PCT/GB2003/003686 



- 46 - 

(s, 3H, SCH 3 ) , 2.20-2.26 (m, 1H, H-2'), 3.63-3.74 (m, 
2H, H-5'), 3.92-3.95 (m # 1H, H-4'), 4.11-4.13 (m, 2H, 
CH 2 ) , 4.28-4.30 (m, 1H, H-3'), 4.59 (br s, 2H, CH 2 ) , 
5.97 (t, J m 6.9 Hz, 1H, H-l'), 7.85 (s, 1H, H-6) f 
5 9.95 (t, J - 5.3 Hz, 1H, NH) , 11.64 (s, 1H, NH) . Mass 
(-ve electrospray) calcd for C 2 2H32F3N 3 0 6 SSi 551.17, 
found 550. 




3' -O-Azidoxnethyl-5- [3- (2 r 2 , 2-trif luoroacetamido) -prop- 
1-ynyl] -2 # -deoxyuridine (4) . 

15 To a solution of (3) (348 mg, 0.63 mmol) and 

cyclohexene (0.32 ml # 3.2 mmol) in dry CH 2 C1 2 (5 ml) 
at 4 °C # sulfurylchoride (1M in CH 2 C1 2/ 0.76 ml, 0.76 
mmol) was added drop wise under N 2 . After 10 min TLC 
indicated the full consumption of the nucleoside (3) . 

20 The solvent was evaporated and the residue was 

subjected to high vacuum for 2 0 min. It was then 
redissolved in dry DMF (3 ml) and treated with NaN 3 
(205 mg, 3.15 mmol). The resulting suspension was 
stirred under room temperature for 2h. The reaction 

25 was quenched with CH 2 C1 2 and the organic layers were 

washed with sat aq. NaCl solution. After removing the 
solvent, the resulting yellow gum was redissolved in 
THF (2 ml) and treated with TBAF (1 M in THF, 0.5 ml) 
at room temperature for 30 min. The solvent was 
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removed and the reaction worked up with CH 2 C1 2 and 
sat. aq. NaHC0 3 solution. The aqueous layer was 
extracted three times with CH 2 C1 2 . Purification by 
chromatography on silica (EtOAc : petroleum ether 1:1 
5 to EtOAc) gave (4) (100 mg, 37 %) as a pale yellow 

foam. X H NMR (d 6 DMSO) 6 2.15-2.26 (m, 2H, H-2'), 3.47- 
3.57 (m, 2H, H-5'), 3.88-3.90 (m, 1H, H-4'), 4.14 (d, 
J = 4.7 Hz, 2H, CH 2 NH) , 4.24-4.27 (m, 1H, H-3'), 4.75 
(s, 2H, CH 2 N 3 ) , 5.14 (t, <J = 5.2 Hz, 1H, OH), 5.96-. 
10 6.00 (m, 1H, H-l'), 8.10 (s, 1H, H-6) , 10.00 (s, 1H, 
NHCOCF3) ) , 11.26 (s, 1H, NH) . 




15 Preparation of bis ( tri-n-butylammonium) pyrophosphate 
(0.5 M solution in DMF) 

Tetrasodium diphosphate decahydrate (1.5 g, 3.4 mmol) 
was dissolved in water (34 ml) and the solution was 

20 applied to a column of dowex in the H* form. The 
column was eluted with water. The eluent dropped 
directly into a cooled (ice bath) and stirred solution 
of tri-n-butylamine (1.6 ml, 6.8 mmol) in EtOH (14 
ml) . The column was washed until the pH of the eluent 

25 increased to 6. The aq. ethanol solution was 

evaporated to dryness and then co-evaporated twice 
with ethanol and twice with anhydrous DMF. The residue 
was dissolved in DMF (6.7 ml) . The pale yellow 
solution was stored over 4A molecular sieves. 
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3' -O-Azidomethyl-5- (3 -amino -prop- 1-ynyl) -2' - 
deoxyuridine 5 ' -O-nucleoside triphosphate (5). 

5 The nucleoside (4) and proton sponge was dried over 

P 2 0 5 under vacuum overnight. A solution of (4) (92 mg, 
0.21 mmol) and proton sponge (90 mg, 0.42 mmol) in 
trimethylphosphate (0.5 ml) was stirred with 4A 
molecular sieves for 1 h. Freshly distilled POCl 3 (24 
10 0.2 6 mmol) was added and the solution was stirred 

at 4°C for 2h. The mixture was slowly warmed up to 
room temperature and bis (tri-n-butyl ammonium) 
pyrophosphate (1.7 ml, 0.85 mmol) and anhydrous tri-n- 
butyl amine (0.4 ml, 1.7 mmol) was added. After 3 min, 
15 the reaction was quenched with 0.1 M TEAB 

(triethylammonium bicarbonate) buffer (15 ml) and 
stirred for 3h. The water was removed under reduced 
pressure and the resulting residue dissolved in 
concentrated ammonia (p 0.88, 15 ml) and stirred at 
20 room temperature for 16 h. The reaction mixture was 

then evaporated to dryness. The residue was dissolved 
in water and the solution applied to a DEAE-Sephadex 
A-2 5 column. MPLC was performed with a linear gradient 
of TEAB. The triphosphate was eluted between 0.7 M and 
25 0.8 M buffer. Fractions containing the product were 
combined and evaporated to dryness. The residue was 
dissolved in water and further purified by HPLC. HPLC: 
t r (5) : 18.8 min (Zorbax C18 preparative column, 
gradient: 5% to 35% B in 30 min, buffer A 0 . 1M TEAB, 
30 buffer B MeCN) The product was isolated as a white 
foam (76 O.D., 7.6 jamo! , 3.8%, e 2a o = 10000). X H NMR 
(D z O) 8 1.79 (s, CH 2 ), 2.23-2.30; 2.44-2.50 (2 x m, 2H, 
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H-2'), 3-85 (m, CH 2 NH) , 4.10-4.18 (m, 2H, H-5'), 4,27 
(br s, H-4'), 4.48-4.50 (m, H-3'), 4.70-4.77 (m, 
CH 2 N 3 ) , 6.21 (t, J = 6.6 Hz ; H-l'), 8.32 (s, 1H, H-6) . 
31 P NMR (D 2 0) 8 -6.6 (m, IP, P y ) , -10.3 (d, .7=18.4 Hz, 
5 IP, P a ), -21.1 (m, IP, Pp) . Mass (-ve electrospray) 
calcd for C 13 H 19 N 6 0 14 P 3 576.02, found 575. 




10 Cy-3disulfide linker. 

The starting disulfide (4.0 mg, 13.1 |imol) was 
dissolved in DMF (300 jiL) and diisopropylethylamine (4 
|iL) was slowly added. The mixture was stirred at room 
temperature and a solution of Cy-3 dye (5 mg, 6.53 

15 jimol) in DMF (3 00 ^L) was added over 10 min. After 3.5 
h, on complete reaction, the volatiles were evaporated 
under reduced pressure and the crude residue was HPLC 
purified on a Zorbax analytical column SB-C18 with a 
flow rate of lml/min in 0 . 1M triethylammonium 

20 bicarbonate buffer (buffer A) and CH 3 CN (buffer B) 
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using the following gradient :. 5 min 2% B;.31min 55% B; 
33 min 95% B;.37 min 95%;. 39 min 2% B;.44 min. 2% B. 
The expected Cy3-disulf ide linker was eluted with a 
t r : 21.8 min. in 70% yield (based on a UV measurement; 
5 e S5 o 150, 000 cm" 1 M' 1 in H 2 0) as a hygroscopic solid. X H 
NMR (D 2 0) 5 1.31-1.20 (m + t, J = 7.2 Hz, 5H, CH 2 + 
CH 3 ), 1.56-1.47 (m, 2H, CH 2 ) , 1.67 (s, 12H, 4 CH 3 ) , 
1.79-1.74 (m, 2H, CH 2 ) , 2.11 (t, J = 6.9 Hz, 2H, CH 2 ) , 
2.37 (t, J = 6.9 Hz, 2H, CH 2 ) , 2.60 (t, J « 6.3 Hz, 

10 2H, CH 2 ) , 2.67 (t, J - 6.9 Hz, 2H, CH 2 ) , 3.27 (t, J = 
6.1 Hz, 2H, CH 2 ) , 4.10-4.00 (m, 4H, 2CH 2 ) , 6.29 (dd, J 
=13.1, 8.1 Hz, 2H, 2 =CH) , 7.29 (dd, 2H, J= 8.4, 6.1 
Hz, 2 =CH) , 7.75-7.71 (m, 2H, 2 =CH) , 7.78 (s, 2H, 
=CH) , 8.42 (t, J =12.8 Hz, 1H, =CH) . Mass (-ve 

15 electrospray) calcd for C3SH47N3O9S4 793.22, found 792 
(M-H) , 396 [M/2] . 
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A mixture of Cy3 disulphide linker (2.5 ^mol) , 
5 disuccinimidyl carbonate (0,96 mg ; 3.75 ^mol) and DMAP 
(0,46 mg, 3.75 ^imol) were dissolved in dry DMF (0.5 
ml) and stirred at room temperature for 10 min. The 
reaction was monitored by TLC (MeOH:CH 2 Cl 2 3:7) until 
all the dye linker was consumed. Then a solution of 
10 (5) (7.5 |imol) and n-Bu 3 N (30 nl , 125 ^mol) in DMF 

(0.2 ml) was added to the reaction mixture and stirred 
at room temperature for 1 h. TLC (MeOH:CH 2 Cl 2 4:6) 
showed complete consumption of the activated ester and 
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a dark red spot appeared on the baseline. The reaction 
was quenched with TEAB buffer (0.1M, 10 ml) and loaded 
on a DEAE Sephadex column (2x5 cm) . The column was 
first eluted with 0.1 M TEAB buffer (100 ml) to wash 
5 off organic residues and then 1 M TEAB buffer (100 

ml). The desired triphosphate analogue (6) was eluted 
out with 1 M TEAB buffer. The fraction containing the 
product were combined, evaporated and purified by 
HPLC. HPLC conditions: t r (6): 16.1 min (Zorbax C18 
10 preparative column, gradient: 2% to 55% B in 30 min, 
buffer A 0.1M TEAB, buffer B MeCN) . The product was 
isolated as dark red solid (1.35 jxmol, 54%, e 550 = 
150000). X H NMR (D 2 0) 5 1.17-1.28 (m, 6H 3 x CH 2 ) , 
1.41-1.48 (m, 3 H, CH 3 ) , 1.64 (s, 12H, 4 x CH 3 ) , 1.68- 

15 1.71 (m, 2H, CH 2 ) , 2.07-2.10 (m, 3H, H-2', CH 2 ) , 2.31- 

2.35 (m, 1H, H-2'), 2.50-2.54 (m, 2H, CH 2 ) , 2.65 (t, J 
= 5.9 Hz, 2H, CH 2 ) , 2.76 (t, J = 7.0 Hz, 2H, CH 2 ) , 
3.26-3.31 (m, 2H, CH 2 ) , 3.88-3.91 (m, 2H CH 2 ) , 3.94- 
.4.06 (m, 3H, CH 2 N, H-5'), 4.16 (br s, 1H, H-4'), 4.42- 

20 4.43 (m, 1H, H-3'), 4.72-4.78 (m, 2H, CH 2 N 3 ) , 6.24 

(dd, J = 5.8, 8.2Hz, H-l') , 6.25 (dd, J = 3.5, 8.5 
Hz, 2H, H^), 7.24, 7.25 (2d, J = 14.8 Hz, 2 x =CH) , 
7.69-7.86 (m, 4H, H Ar , H-6) , 8.42 (t, J =13.4 Hz , 
=CH) . 31 P NMR (D 2 0) 8 -4.85 (m, IP, P Y ) , -9.86 (m, IP, 

25 P a )/ -20.40 (m, IP, P p ) . Mass (-ve electrospray) calcd 
for C 4 9H 6 4N90 22 P 3 S4 1351.23, found 1372 (M-2H+Na) , 1270 
[M-80] , 1190 [M-160] . 
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OH OH (7) 

5- [3- (2 , 2 , 2-Trif luoroacetamido) -prop- 1-ynyl] -2'- 
deoxycytidine (7) . 

5 

To a solution of 5-iodo-2 ' -deoxycytidine (10 g, 28.32 
mmol) in DMF (200 ml) in a light protected round 
bottom flask under Argon atmosphere, was added Cul 
(1.08 g, 5.67 mmol), triethylamine (7.80 ml, 55.60 

10 mmol) , 2,2,2-trifluoro-N-prop-2-ynyl-acetamide (12.8 
g, 84.76 mmol) and at last Pd(PPh) 3 )4 (3.27 g, 2.83 
mmol) . After 18 hours at room temperature, dowex 
bicarbonate (2 0 mg) was added and the mixture was 
stirred for a further 1 h. Filtration and evaporation 

15 of the volatiles under reduced pressure gave a residue 
that was purified by flash chromatography on silica 
gel (CH 2 C1 2 , CH 2 Cl 2 :EtOAc 1:1, EtOAc : MeOH 9:1). The 
expected product (7) was obtained as a beige solid in 
quantitative yield. X H NMR (D 2 0) 5 2.24-2.17 (m, 1H, 

20 H-2'), 2.41-2.37 (m, 1H, H-2'), 3.68 (dd, J = 12.5, 

5.0 Hz, 1H, H-5'), 3.77 (dd, J" = 12.5, 3.2 Hz, 1H, H- 
5'), 3.99 (m, 1H, H-4'), 4.27 (s, 2H, CH 2 N) , 4.34 (m, 
1H, H-3') , 6.11 (t, J" = 6.3 Hz, 1H, H-l') , 8.1 (br s, 
1H, NH) ; MS (ES) : m/z (%) (M-H) 375 (100). 



25 
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5' -O- (terfc-Butyldimethylsilyl) -5- [3- (2,2,2- 
5 trifluoroacetamido) -prop-l-ynyl] -2' -deoxycytidine (8) . 

To a solution of the starting material (7) (1.0 g, 
2.66 mmol) and imidazole (200 mg, 2.93 mmol) in DMF 
(3.0 ml) at 0 °C, was slowly added TBDMSC1 (442 mg, 

10 2.93 mmol) in four portions over 1 h. After 2 h, the 
volatiles were evaporated under reduced pressure and 
the residue was adsorbed on silica gel and purified by 
flash chromatography (EtOAc, EtOAc : MeOH 9.5:0.5). The 
expected product (8) was isolated as a crystalline 

15 solid (826 mg, 64%). X H NMR (d 6 DMSO) 8 0.00 (s, 1H, 

CH 3 ) ; 0.01 (s, 1H, CH 3 ) , 0.79 (s, 9 H, tBu) , 1.87-1.80 
(m, 1H, H-2'), 2.12 (ddd, J = 13.0, 5.8 and 3.0 Hz, 
1H, H-2'), 3.65 (dd, J = 11.5, 2.9 Hz, 1H, H-5'), 3.74 
(dd, J = 11.5, 2.5Hz, 1H, H-5'), 3.81-3.80 (m, 1H, H- 

20 4'), 4.10-4.09 (m, 1H, H-3'), 4.17 (d, 2H, J = 5.1 Hz, 
NCH 2 ) , 5.19 (d, 1H, J= 4.0 Hz, 3 ' -OH) , 6.04 (t, J = 
6.6 Hz, 1H, H-l'), 6.83 (br s, 1H, NHH) , 7.78 (br s, 
1H, NHH), 7.90 (s, 1H, H-6) , 9.86 <t, J = 5.1 Hz, 1H, 
-H 2 CNH) ; MS (ES) : m/z (%) (MH) * 491 (40%). 
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TBDMS* 




(9) 



4-N-Acetyl-5 ' -O- (tert-butyldlmethylsilyl ) -3' -O- 
5 (methyl thiolmethyl) -5- [3- (2, 2, 2-trif luoroacetamide) - 
prop-l-ynyl] -2' -deoxycytidine (9) . 

To a solution of the starting material (8) (825 mg, 
1.68 mmol) in DMSO (6.3 ml) and N 2 atmosphere, was 

10 slowly added acetic acid (AcOH) (1.3 ml, 23.60 mmol) 
followed by acetic anhydride (Ac 2 0) (4.8 ml, 50.50 
mmol) . The solution was stirred at room temperature 
for 18 h and quenched at 0 °C by addition of saturated 
NaHC0 3 (20 ml) . The product was extracted into EtOAc 

15 (3 x 30 ml) , organic extracts combined, dried (MgS0 4 ) , 
filtered and the volatiles evaporated. The crude 
residue was purified by flash chromatography on silica 
gel (EtOAc: petroleum ether 1:1) to give the expected 
product as a colourless oil (9) (573 mg, 62%) . *H NMR 

20 (d« DMSO) 8 0.00 (s, 6H, 2 x CH 3 ) , 0.78 (s, 9H, tBu) , 

2.01 (s, 3H, SCH 3 ) , 2.19-1.97 (m, 2H, 2 x H2 ' ) , 2.25 
(s, 3H, C0CH 3 ), 3.67 (dd, 1H, J = 11.5 Hz, H-5'), 3.78 
(dd, 1H, J" = 11.5, 3.3 Hz, H-5'), 4.06-4.05 (m, 1H, H- 
4'), 4.17 (d, 2H, J = 5.1 Hz, N-Ctf 2 ) , 4 .30-4.28 (m, 

25 1H, H-3'), 4.63 (s, 2H; Ctf 2 -S) , 5.94 (t, 1H, J = 6.5 

Hz, H-l'), 8.17 (s, 1H, H-6) , 9.32 (s, 1H, NHCO) , 9.91 
(t, 1H, J= 5.4 Hz, NHCH 2 ) ; MS (ES) : m/z (%) (MH) + 
593 . 
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TBOMSi 




(10) 

4 -tf- Acetyl- 3' -O- (azidomethyl) -5' -O- {tert- 
5 butyldimethylsilyl) -5- [3- (2 , 2 , 2- trif luoroacetamide) - 
prop-l-ynyl] -2' -deoxycytidine (10) . 

To a solution of the starting material (9) (470 mg, 
0.85 mmol) in dicloromethane (DCM) (8 ml) under N 2 

10 atmosphere and cooled to 0 °C, was added cyclohexene 
(430 /zl, 4.27 mmol) followed by S0 2 C1 2 (1 M in DCM, 
i.O ml, 1.02 mmol). The solution was stirred for 30 
minutes at 0 °C, and the volatiles were evaporated. 
Residue immediately dissolved in DMF (8 ml) stirred 

15 under N 2 and sodium azide (2 75 mg, 4.2 7 mmol) slowly 

added. After 18 h, the crude product was evaporated to 
dryness, dissolved in EtOAc (30 ml) and washed with 
Na 2 C0 3 (3 x 5 ml) . The combined organic layer was kept 
separately. A second extraction of the product from 

20 the aqueous layer was performed with DCM (3 x 10 ml) . 
All the combined organic layers were dried (MgS0 4 ) , 
filtered and the volatiles evaporated under reduced 
pressure to give an oil identified as the expected 
product (10) (471 mg, 94 % yield) . This was used 

25 without any further purification. X H NMR (d 6 DMSO) 5 
0.11 (s, 3H, CH 3 ) , 0.11 (s, 3H, CH 3 ) , 0.88 (s, 9H, 
c Bu) , 2.16-2.25 (m, 1H, H-2'), 2.35 (s, 3H, COCH 3 ) , 
2.47-2.58 (m, 1H, H-2'), 3.79 (dd, J « 11.6, 3.2 Hz, 
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1H, H-5'), 3.90 (dd, J" = 11.6, 3.0 Hz, 1H, H-5' ) , 
4.17-4.19 (m, 1H, H-4'), 4.28 (s, 2H, NCH 2 ) , 4.32-4.35 
(m, 1H, H-3'), 4.89 (dd, J" = 14.4, 6.0 Hz, 2H, CH 2 - 
N 3 ) , 6.05 (t, J = 6.4 Hz, 1H, H-l'), 8.25 (s, 1H, H- 
5 6), 9.46 (br s, 1H, NHH) , 10.01 (br s, 1H, NHH). 




(ID (12) 

10 

4-AT-Acetyl-3' -O- (azidomethyl) -5- [3- (2,2, 2- 
trifluoroacetamido) -prop-l-ynyl] -2' -deoxycytldine and 
3' -O- (Azidomethyl) -5- [3 - (2 , 2 , 2 - trif luoroace tamido) - 
prop-l-ynyl] -2' -deoxycytldine (11) . 

15 

To a solution of the starting material (11) (440 mg, 
0.75 mmol) in THF (20 ml) at 0 °C and N 2 atmosphere, 
was added TBAF in THF 1.0 M (0.82 ml, 0.82 mrnol) . 
After 1.5 h, the volatiles were evaporated under 

20 reduced pressure and the residue purified by flash 
chromatography on silica gel (EtOAc : petroleum ether 
8:2 to EtOAc 100 % to EtOAc : MeOH 8:2). Two compounds 
were isolated and identified as above described. The 
first eluted 4-N-Acetyl (11), (53 mg, 15 %) and, the 

25 second one 4-NH 2 (12) (271 mg, 84 %) . 

Compound 4-N-Acetyl (11): X H NMR (d 6 DMSO) 5 1.98 (s, 
3H, CH 3 CO) , 2.14-2.20 (m, 2H, HH-2'), 3.48-3.55 (m, 
1H, H-5'), 3.57-3.63 (m, 1H, H-5'), 3.96-4.00 (m, 1H, 
H-4'), 4.19 (d, «J = 5.3 Hz, 2H, CH 2 -NH) , 4.23-4.28 (m, 
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1H, H-3'), 4.77 (S,- 2H, CH 2 -N 3 ) , 5.2 (t,!H, cT= 5.1 
Hz, 5'-OH), 5.95 (t, J = 6.2 Hz, 1H, H-l'), 8.43 (s, 
1H, H-6) , 9.34 (s, 1H, CONtf) , 9.95 (t, J ' = 5.3 Hz, 1H, 
NHCH 2 ) . 

5 Compound 4-NH 2 (12): X H NMR (d 6 DMSO) 6 1 . 98-2 . 07 (2H, 

CHtf-2'), 3.50-3.63 (m, 2H, Ctftf-5'), 3.96-4.00 (m, 1H, 
H-4'), 4.09 (d, J= 5.3 Hz, 2H, CH 2 -NH) , 4.24-4.28 (m, 
1H, H-3'), 4.76 (s, 2H, CH 2 -N 3 ) , 5.13 (t , J= 5.3 Hz, 
1H, 5'-OH), 5.91 (br s, 1H, NHH) , 6.11 (t, J = 6.4 Hz, 
10 1H, H-l'), 8.20 (t, J" = 5.3 Hz, 1H, NCH 2 ) , 8.45 (s, 

1H, H-6), 11.04 (br s, 1H, NHH). 




(13) 

15 

4-N-Benzoyl-5' -O- ( tex-t-butyldimethylsilyl) -5- [3- 
(2, 2 , 2- trif luoroacetamido) -prop-l-ynyl] -2 ' - 
deoxycytidine (13) . 

20 The starting material (8) (lOg, 20.43 mmol) was 

azeotroped in dry pyridine (2 x 100 ml) then dissolved 
in dry pyridine (160 ml) under N 2 atmosphere. 
Chlorotrimethylsilane (10 ml, 79.07 mmol) added drop 
wise to the solution and stirred for 2 hours at room 

25 temperature. Benzoyl chloride (2.6 ml, 22.40 mmol) was 
then added to solution and stirred for one further 
hour. The reaction mixture was cooled to 0°C, 
distilled water (50 ml) added slowly to the solution 
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and stirred for 30 minutes. Pyridine and water were 
evaporated from mixture under high vacuum to yield a 
brown gel that was portioned between 100 ml of sat. 
aq. NaHC0 3 (100 ml) solution DCM. The organic phase 
5 was separated and the aqueous phase extracted with a 
further (2 x 100 ml) of DCM. The organic layers were 
combined, dried (MgS0 4 ) , filtered and the volatiles 
evaporated under reduced pressure. The resulting brown 
oil was purified by flash chromatography on silica gel 

10 (DCM : MeOH 99:1 to 95:5) to yield a light yellow 
crystalline solid (13) (8.92 g, 74%) . 1 X H NMR (d 6 
DMSO) : 6 0.00 (s, 6H, CH 3 ) , 0.78 (s, 9H, tBu) , 1.94 
(m # 1H, H-2'), 2.27 (m, 1H, H-2'), 3.64 (d, 1H, J = 
11.6 Hz, H-5') , 3.75 (d, 1H, J = 11.6 Hz, H-5') , 3.91 

15 (m, 1H, H-4'), 4.09 (brm, 3H, CH 2 NH, H-3'), 5.24 (s, 

1H, 3'-OH), 6.00 (m, 1H, H-l'), 7.39 (m, 2H, Ph) , 7.52 
(m, 2H, Ph) , 7.86 (m, 1H, Ph) , 8.0 (s, 1H, H-6), 9.79 
(t, 1H, J= 5.4 Hz, NHCH 2 ), 12.67 (br s, 1H, NH) . 
Mass (+ve electrospray) calcd for C 2 7H33F 3 N40 € Si 594.67, 

20 found 595. 




4 -N- Benzoyl- 5' -O- (tert-butyldimethylsilyl) -3' -O- 
25 methylthiomethyl-5- [3- (2, 2,2-trif luoroacetamido) -prop- 
1-ynyl] -2' -deoxycytidine (14) . 



The starting material (13) (2.85 g, 4.79 mmol) was 
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dissolved in dry DMSO (40 ml) under N 2 atmosphere. 
Acetic acid (2.7 ml, 4 7.9 mmol) and acetic anhydride 
(14.4 ml, 143.7 mmol) were added sequentially and 
slowly to the starting material, which was then 
5 stirred for 18 h at room temperature. Saturated NaHC0 3 
(150 ml) solution was carefully added to the reaction 
mixture. The aqueous layer was extracted with EtOAc (3 
x 150 ml) . The organic layers were combined, dried 
(MgS0 4 ) , filtered and evaporated to yield an orange 

10 liquid that was subsequently azeotroped with toluene 
(4 x 150 ml) until material solidified. Crude residue 
purified on silica gel (petroleum ether :EtOAc 3:1 to 
2:1) to yield a yellow crystalline solid (14) (1.58 g, 
50%). X K NMR (d 6 DMSO): 6 0.00 (s, 6H, CH 3 ) , 0.78 (s, 

15 9H, tBu) , 1.99 (s, 3H, CH 3 ) , 2.09 (m, 1H, H-2'), 

2.28 (m, 1H, H-2'), 3.66 (d, 1H, J* = 11.5, 2.9 Hz, 
H-5'), 3.74 (dd, 1H, J - 11.3, 2.9 Hz, H-5'), 3.99 
(m, 1H, H-4'), 4.09 (m, 1H, CH 2 NH) , 4.29 (m, 1H, H- 
3'), 4.61 (s, 2H, CH 2 S) , 6.00 (m, 1H, H-l'), 7.37 (m, 

20 2H, Ph) , 7.50 (m, 2H, Ph) , 7.80 (d, 1H, J = 7.55 Hz, 
H Ar ) , 7.97 (s, 1H, H-6) , 9.79 (br t, 1H, NHCH 2 ) , 12.64 
(br s, 1H, NH) . Mass (-ve electrospray) calcd for 
C 2 9H37F3N 4 0 6 SSi 654.79, found 653.2. 



25 




(15) 
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4 -//-Benzoyl- 5 ' -O- (tert-butyldimethylsilyl) -3' -O- 
azidoxnethyl-5- [3- (2,2, 2-trif luoroacetamido) -prop-1- 
ynyl] -2' -deoxycytidine (15) . 

5 The starting material (14) (1.65 g, 2.99 mmol) was 
dissolved in DCM (18 ml) and cooled to 0°C. 
Cyclohexene (1.5 ml, 14.95 mmol) and S0 2 C1 2 (0.72 ml, 
8.97 mmol) were added and stirred 1 h in ice bath. TLC 
indicated starting material still to be present 
10 whereupon a further aliquot of S0 2 C1 2 (0.24 ml) was 
added and the mixture stirred for 1 li at 0°C. 
Volatiles were removed by evaporation to yield a light 
brown solid that was redissolved in 18 ml of dry DMF 
(18 ml) under N 2 . Sodium azide (0.97 g, 14.95 mmol) 

15 was then added to the solution and stirred for 2.5 h 
at room temperature. The reaction mixture was passed 
through a pad of silica and eluted with EtOAc and the 
volatiles removed by high vacuum evaporation. The 
resulting brown gel was purified by flash 

20 chromatography (petroleum ether :EtOAc 4:1 to 2:1) to 

yield the desired product as a white crystalline solid 
(15) (0.9 g, 55%). X H NMR (d 6 DMSO) : 6 0.00 (s, 6H, 
CH 3 ) , 0.78 (s, 9H, tBu) , 2.16 (m, 1H, H-2'), 2.22 (m, 
1H, H-2'), 3.70 (d, 1H, J = 11.5 Hz, H-5'), 3.75 (d, 

25 1H, J= 11.3 Hz, H-5'), 4.01 (m, 1H, H-4'), 4.10 (m, 

1H, CH 2 NH) , 4.23 (m, 1H, H-3'), 4.76 (s, 2H, CH 2 S) , 
5.99 (m, 1H, H-l'), 7.37 (m, 2H, Ph) , 7.50 (m, 2H, 
Ph) , 7.81 (d, 1H, J = 7.4 Hz, Ph) , 7.95 (s, 1H # H-6) , 
9.78 (br s, 1H, NHCH 2 ) , 12.64 (br s, 1H, NH) . Mass (- 

30 ve electrospray) calcd^ for C 2 8H34F 3 N70 6 Si 649.71, found 
648.2 
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TBDMS< 




(16) 

4 -tf -Benzoyl -3' -O-azidomethyl-5- [3- (2,2,2- 
5 trif luoroacetamido) -prop-l-ynyl] -2' -deoxycytidine 
(16). 

The starting material (15) (140 mg, 0.22 mmol) was 
dissolved in THF (7.5 ml). TBAF (1M soln. in THF, 0.25 

10 ml) was added slowly and stirred for 2 h at room 

temperature. Volatile material removed under reduced 
pressure to yield a brown gel that was purified by 
flash chromatography (EtOAc : DCM 7:3) to yield the 
desired product (16) as a light coloured crystalline 

15 solid (0.9 g, 76%). X H NMR (d 6 DMSO) : 6 2 . 16 (m, 1H, H- 
2'), 2.22 <m, 1H, H-2'), 3.70 (d, 1H, J = 11.5 Hz, H- 
5'), 3.75 (d, 1H, J - = 11.3 Hz, H-5'), 4.01 (m, 1H, H- 
4'), 4.10 (m, 1H, CH 2 NH) , 4.23 (m, 1H, H-3'), 4.76 (s, 
. 2H, CH 2 S), 5.32 (s, 1H, 5' OH), 5.99 (m, 1H, H-l'), 

20 7.37 (m, 2H, Ph) , 7.50 (m, 2H, Ph) , 7.81 (d, 1H, J = 
7.35 Hz, Ph) , 7.95 (s, 1H, H-6) , 9.78 (br s, 1H, 
NJiCH 2 ) , 12.64 (br s, 1H, NH) . Mass (-ve electrospray) 
calcd for C 22 H 2 oF3N 7 0 6 53 5.44, found 534. 



25 
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FjCOCHN 




(17) 



5- (3 -Amino-prop-l-ynyl) -3' -O-azidomethyl-2 ' - 
5 deoxycytidine 5 ' -O-nucleoside triphosphate (17). 

To a solution of (11) and (12) (290 mg, 0.67 mmol) 
and proton sponge (175 mg, 0.82 mmol) (both previously 
dried under P 2 O s for at least 24 h) in PO(OMe) 3 (600 

10 at 0 °C under Argon atmosphere, was slowly added 

POCl 3 (freshly distilled) (82 0.88 mmol). The 

solution was vigorously stirred for 3 h at 0 °C and 
then quenched by addition of tetra- tributylammonium 
diphosphate (0.5 M) in DMF (5.2 ml, 2.60 mmol), 

15 followed by nBu 3 N (1.23 ml, 5.20 mmol) and 

triethylammonium bicarbonate (TEAB) 0.1 M (20 ml). 
After 1 h at room temperature aqueous ammonia solution 
(p 0.88, 2 0 ml) was added to the mixture. Solution 
stirred at room temperature for 15 h, volatiles 

20 evaporated under reduced pressure and the residue was 
purified by MPLC with a gradient of TEAB from 0.05M to 
0.7M. The expected triphosphate was eluted from the 
column at approx. 0.60 M TEAB. A second purification 
was done by HPLC in a Zorbax SB-C18 column (21.2 mm 

25 i.d. x 25 cm) eluted with 0 . 1M TEAB (pump A) and 3 0% 
CH 3 CN in 0.1M TEAB (pump B) using a gradient as 
follows: 0-5 min 5% B, 0.2 ml; 5-25 min 80% B, 0 . 8 
ml; 25-27 min 95 %B, 0 . 8 ml; 27-30 min 95 %B, <D.8 ml; 
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30-32 min 5 %B, <D.8 ml; 32-35 min 95 %B, 0,2 ml, 
affording the product described above with a r t (17): 
20.8 (14.5 nmols, 2.5% yield); 31 P NMR (D 2 0, 162 MHz) 8 
-5.59 (d, J = 20.1 Hz, P x ) , -10.25 (d, J = 19.3 Hz, 
5 IP, P a ) , -20.96 (t, J" = 19.5 Hz, IP, Pp) ; X H NMR (D 2 0) 
8 2.47-2.54 (m, 1H, H-2'), 2.20-2.27 (m, 1H, H-2'), 
3.88 (s, 2H, CH 2 N) , 4.04-4.12 (m, 1H, HH-5'), 4.16- 
4.22 (m, 1H, HH-5'), 4.24-4.30 (m, 1H, H-4'), 4.44- 
4.48 (m, 1H, H-3'), 6.13 (t, J" = 6.3 Hz, 1H, H-l'), 
10 8.35 (S, 1H, H-6); MS (ES) : m/z (%) (M-H) 574 (73%), 

494 (100 %) . 



o "NH3* T I 





X\HN 




C0 2 " HO 



♦h 2 n" ^or ^nh 2 

♦H 2 N^ ^ ^O' y ^NH 2 SO3- 

so 3 - so 3 - 

2 (HNEt 3 ) + 

2 (HNEt 3 f 



15 Alexa488 disulfide linker. 



Commercial available Alexa Fluor 488-NHS (35 mg , 54 
|imol) was dissolved in DMF (700 ^L) and, to ensure 
full activation, 4-DMAP (7 mg, 59 jimol) and N,N'- 
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disuccinimidyl carbonate (15 mg, 59 ^imol) were 
sequentially added. After 15 min on complete 
activation, a solution of the starting disulfide (32.0 
mg, 108 ^imol) in DMF (300 jiL) containing 
5 diisopropylethylamine (4 |iL) was added over the 

solution of the activated dye. Further addition of 
diisopropylethylamine (20 jaL) to the final mixture was 
done, ultrasonicated for 5 min and reacted for 18 h at 
room temperature in the darkness. The volatiles were 
10 evaporated under reduced pressure and the crude 

residue was first purified passing it through a short 
ion exchange resin Sephadex -DEAE A-25 (40-120 
column, first eluted with TEAB 0.1 M (25 ml) then 1.0 
M TEAB (75 ml ) . The latest containing the two final 
15 compounds was concentrated and the residue was HPLC 

purified in a Zorbax SB-C18 column (21.2 mm i.d. x 25 
cm) eluted with 0 . 1M TEAB (pump A) and CH 3 CN (pump B) 
using a gradient as follows: 0-2 min 2% B, <D.2 ml; 2-4 
min 2% B, <D.8 ml; 4-15 min 23 %B, <D.8 ml; 15-24 min 
20 23 %B, <D.8 ml; 24-26 min 95 %B, 0.8 ml; 26-28 min 95 
%B, <X>.8 ml, 28-30 min 2 %B, 0 . 8 ml, 30-33 min 2 %B, 
0,2 ml affording both compounds detailed above with 
t r : 19.0 (left regioisomer) and t r : 19.5 (right 
regioisomer) . Both regioisomers were respectively 
25 passed through a dowex ion exchange resin column, 

affording respectively 16.2 jimol and 10.0 ^mol, 62% 
total yield (based in commercial available Alexa 
Fluor 488-NHS of 76% purity); e 493 = 71, 000 cm" 1 NT 1 in 
H 2 0. 1 H NMR (D 2 0) (left regioisomer) 5 2.51 (t, J" = 6-8 
30 . Hz, 2H, CH 2 ) , 2.66 (t, J = 6.8 Hz, 2H, CH 2 ) , 2.71 (t , J 
= 5.8 Hz, 2H; CH 2 ) , 3.43 (t, J = 5.8 Hz, 2H, CH 2 ) , 
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6.64 (d, J = 9.2 Hz, 2H, H Ar ) , 6.77 (d, J = 9.2 Hz, 
2H, H Ar ) , 7.46 (s, 1H, H Ar ) , 7.90 (dd, J =8.1 and 1.5 
Hz, 1H, H Ar ) , 8.20 (d, J = 8.1 Hz, 1H, H Ar ) . X H NMR 
(D 2 0) (right regioisomer) 8 2.67 (t, J = 6.8 Hz, 2H, 
5 CH 2 ) , 2.82 (t, J" = 6.8 Hz, 2H, CH 2 ) , 2.93 (t, J = 6.1 
Hz, 2H, CH 2 ) , 3.68 (t, J = 6.1 Hz, 2H, CH 2 ) , 6.72 (d, 
J = 9.3 Hz, 2H, H Ar ) , 6.90 (d, J = 9.3 Hz, 2H, H Ar ) , 
7.32 (d, J =7.9 Hz, 1H, H Ar ) , 8.03 (dd, J = 7.9, 1.7 
Hz, 1H, Hat), 8.50 (d, J = 1.8 Hz, 1H, H^) Mass (-ve. 
10 electrospray) calcd for C 26 H 2 3N 3 Oi2S4 697.02, found 692 
(M-H) , 347 [M/2] . 




(18) 

15 

To a solution of Alexa Fluor 488 disulfide linker (3.4 
umol, 2.37mg) in DMF (200 uL) was added 4-DMAP (0.75 
mg, 5.1 umol) and N, W-disuccinimidyl carbonate (1.70 
mg, 5.1 umol) . The mixture was stirred for 15 to full 
20 activation of the acid, then it was added into the 
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solution of the nucleotide (17) (3.45 mg, 6.0 nmol) in 
DMF (0.3 ml) containing nBu 3 N (40 jiL) at 0 °C. The 
mixture was sonicated for 3 min and then continuously- 
stirred for 16 h in the absence of light. .The 
5 volatiles were evaporated under reduced pressure and 

the residue was firstly purified by filtration through 
a short ion exchange resin Sephadex -DEAE A- 2 5 column, 
first eluted with TEAB 0.1 M (50 ml) removing the 
unreacted dye -linker, then 1.0 M TEAB (100 ml) to 
10 collect the expected product (18) . After 

concentration and the residue was HPLC purified in a 
Zorbax SB-C18 column (21.2 mm i.d. x 25 cm) eluted 
with 0.1M TEAB (pump A) and CH 3 CN (pump B) using a 
gradient as follows: 0-2 min 2% B, <D.2 ml; 2-4 min 2% 

15 B, 0.8 ml; 4-15 min 23 %B, <D . 8 ml; 15-24 min 23 %B, 

<D.8 ml; 24-26 min 95 %B, <D.8 ml; 26-28 min 95 %B, <D.8 
ml, 28-30 min 2 %B, 0.8 ml, 30-33 min 2 %B, <D.2 ml 
affording the product detailed above with a r t (18): 
19.8 (0.26 jimols, 12% yield based on UV measurement); 

20 A^ax = 493 nm, e 71,000 cm" 1 NT 1 in H 2 0) ; 31 P NMR (D 2 0, 
162 MHz) 8 -5.06 (d, J = 20.6 Hz, IP, P x ) , -10.25 (d, 
J = 19.3 Hz, IP, P a ) , -21.21 (t, J" = 19.5 Hz, IP, P p ) ; 
X H NMR (D 2 0) 5 2.09-2.17 (m, 1H, HH-2'), 2.43-2.50 (m, 
1H, HH-2')/ 2.61 (t, J = 6.8 Hz, 2H, H 2 C-S) , 2.83 (2H, 

25 S-CH 2 ) , 3.68 (t, J= 6.0 Hz, 2H, ArCONCH 2 ) , 4.06 (s, 

2H, CH 2 N) , 4.08-4.17 (m, 4H, HH-S' ) , 4.25-4.29 (m, 1H, 
H-4'), 4.46-4.50 (m, 1H, H-3'), 6.09 (t, J = 6.4 Hz, 
1H, H-l'), 6.88 (d, J = 9.1 Hz, 1H, H^) , 6.89 (d, J - 
9.3 Hz, 1H, H Ar ) / 7.15 % (d, J = 9.3 Hz, 1H, H Ar ) , 7.17 

30 (d, J" = 9.1 Hz, 1H, H Ar ) , 7.64 (br s, 1H, H Ar ) , 8.00- 

7.94 (m, 2H, H Ar ) , 8.04 (s, 1H, H-6) ; MS (ES): m/z (%) 
(M-H) 1253 (46%), (M-H+ Na) _ 1275 (100 %) . 
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OH OH (19) 

5 7-Deaza- [3- (2, 2 , 2- trif luoroacetamido) -prop-l-ynyl] -2* - 
deoxyguanosine (19) . 

Under N 2 , a suspension of 7-deaza-7-iodo-guanosine (2 
g, 2.75 mmol) , Pd(PPh 3 ) 4 (582 mg, 0.55 mmol) , Cul (210 

10 nig, 1.1 mmol), Et 3 N (1.52 ml, 11 mmol) and the 

propagyl amine (2.5 g, 16.5 mmol) in DMF (4 0 ml) was 
stirred at room temperature for 15 h under N 2 . The 
reaction was protected from light with aluminium foil. 
After TLC indicating the full consumption of starting 

15 material, the reaction mixture was concentrated. The 

residue was diluted with MeOH (20 ml) and treated with 
dowex-HC0 3 ~. The mixture was stirring for 30 min and 
filtered. The solution was concentrated and purified 
by silica gel chromatography (petroleum ether :EtOAc 

20 50:50 to petroleum ether: EtOAc : MeOH 40: 40: 20), 

giving (19) as a yellow powder (2.1 g, 92%). 1 H NMR 
(d 6 DMSO) 5 2.07-2.11 (m, 1H, H-2'), 2.31-2.33 (m, 1H, 
H-2'), 3.49-3.53 (m, 2H, H-5'), 3.77 (brs, 1H, H- 
4'), 4.25 (d, J = 4.3 Hz, 2H, sCCH 2 ) , 4.30 (br s, 1H, 

25 H-3'), 4.95 (t, J= 5.2 Hz, 1H, 5'-OH), 5.25 (d, J = 
3.4 Hz, 1H, 3' -OH), 6.27-6.31 (m, 1H, H-l'), 6.37 (s, 
2H, NH 2 ) , 7.31 (s, 1H, H-8), 10.10 (br s, 1H, 
NHCOCF3) , 10.55 (s, 1H, NH) . Mass (-ve elect rospray) 
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calcd for C 16 Hi 6 F 3 N 5 Os 415, found 414. 




(20) 



5 

5' -O- (tert-Butyldiphenyl) -7-deaza-7- [3- (2,2,2- 
trif luoroacetamido) -prop- 1-ynyl] -2' - deoxyguano s ine 
(20) . 

10 A solution of (19) (2.4 g, 5.8 mmol) in pyridine (50 
ml) was treated with tert-butyldiphenylsilyl chloride 
(TBDPSC1) (1.65 ml, 6.3 mmol) drop wise at 0 °C. The 
reaction mixture was then warmed to room temperature. 
After 4h, another portion of TBDPSC1 (260 nL, 1 mmol) 

15 was added. The reaction was monitored by TLC, until 
full consumption of the starting material. The 
reaction was quenched with MeOH (-5 ml) and evaporated 
to dryness. The residue was dissolved in DCM and aq. 
sat. NaHC0 3 was added. The aqueous layer was extracted 

20 with DCM three times. The combined organic extracts 
were dried (MgS0 4 ) and concentrated under vacuum. 
Purification by chromatography on silica (EtOAc to 
EtOAc :MeOH 85:15) gave (20) a yellow foam (3.1 g, 
82%). X H NMR (d 6 DMSO) 5 1.07 (s , 9H, CH 3 ) , 2.19-2.23 

25 (m, 1H, H-2'), 2.38-2^.43 (m, 1H, H-2'), 3.73-3.93 

(m, 2H, H-5'), 4.29 (d, J = 5.0 Hz, 2H, CH 2 N) , 4.42- 
4.43 (m, 1H, H-3'), 5.41 (brs, 1H, OH), 6.37 (t, J= 
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6.5 Hz, H-l'), 6,45 (br s, 2H, NH 2 )' , 7.24-7.71 (m, 
11H, H-8, Hftr) , 10.12 (t, J = 3.6 Hz, 1H, NH) , 10.62 
(s, 1H, H-3) . Mass (+ve electrospray) calcd for 
C 32 H 34 F3N s 0 5 Si 653, found 654. 

5 




5' -O- (tert-Butyldiphenyl) -7-deaza-3' -O- 
methylthiolmethyl-7- [3- (2 , 2 , 2- trif luoroacetamido) - 
10 prop-l-ynyl] -2 ' - deoxyguano s ine (21) . 

A solution of (20) (1.97 g, 3.0 mmol) in DMSO (15 ml) 
was treated with Ac 2 0 (8.5 ml, 90 mmol), and AcOH (2.4 
ml, 42 mmol) and stirred at room temperature for 15 h, 

15 then 2 h at 40°C. The reaction mixture was diluted 

with EtOAc (200 ml) and stirred with sat, aq. NaHC0 3 
(2 00 ml) for 1 h. The aqueous layer was washed with 
EtOAc twice. The organic layer was combined, dried 
(MgS0 4 ) and concentrated under vacuum. Purification by 

20 chromatography on silica (EtOAc :Hexane 1:1 to 

EtOAc : Hexane : MeOH 10:10:1) gave (21) as a yellow foam 
(1.3 g, 60%). X H NMR (CDCI3) 5 1.04 (s, 9H, CH 3 ) , 2.08 
(s, 3H, SCH 3 ) , 2.19-2.35 (m, 2H, H-2), 3.67-3.71 (m, 
2H, H-5'), 3.97-3.99 (m, 2H, H-4', H-3'), 4.23 (br s, 

25 2H, CH 2 N) , 4.58 (s, 2H/ CH 2 S) , 6.31 (dd, J = 5.7, 7.9 

Hz, H-l'), 7.19-7.62 (m, 11H, H8 , H Ar ) . Mass (+ve 
electrospray) calcd for C 3 4H 3 8F3N 5 05SSi 713, found: 714. 
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(22) 



3' -0-Azidomethyl-7-deaza-7- [3- (2,2,2 - 
trifluoroacetamido) -prop-l-ynyl] -2' -deoxyguanosine 
5 (22). 

To a solution of (21) (1.3 mg, 1.8 mmol) , cyclohexene 
(0.91 ml, 9 mmol) in CH 2 C1 2 (10 ml) in 4 °C, 
sulfurylchloride (1M in CH 2 C1 2 ) (1.1 ml, 1.1 mmol) was 
10 added drop wise under N 2 . After 30 min. , TLC indicated 
the full consumption of the nucleoside (22) . After 
evaporation to remove the solvent, the residue was 
then subjected to high vacuum for 20 min, and then 
treated with NaN 3 (585 mmol, 9 mmol) and DMF (10 ml) . 

15 The resulted suspension was stirred under room 

temperature for 2h. Extraction with CH 2 Cl 2 /NaCl (10%) 
gave a yellow gum, which was treated with TBAF in THF 
(1 M, 3 ml) and THF (3 ml) at room temperature for 20 
min. Evaporation to remove solvents, extraction with 

20 EtOAc/sat. aq. NaHC0 3/ followed by purification by 
chromatography on silica (EtOAc to EtOAc :MeOH 9:1) 
gave (22) as a yellow foam (420 mg, 50%) . X H NMR (d 6 
DMSO) : 8 2.36-2.42 (m, 1H, H-2'), 2.49-2.55 (m, 1H, H- 
2'), 3.57-3.59 (m, 2H, H-5'), 3.97-4.00 (m, 1H, H-4'), 

25 4.29 (m, 2H, CH 2 N) , 4.46-4.48 (m, 1H, H-3'), 4.92-4.96 
(m, 2H, CH 2 N 3 ), 5.14 (t, J = 5.4 Hz, 1H, 5'-OH), 5.96- 
6.00 (dd, J = 5.7, 8.7 Hz, 1H, H-l'), 6.46 (br s, 2H, 
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3' -0-Azidomethyl-7-deaza-7- [3- (2,2,2- 

trif luoroacetamido) -prop- 1-ynyl] -2' -deoxy guano sine 5' - 
O -nucleoside triphosphate (23) . 

10 

Tetrasodium diphosphate decahydrate (1.5 g, 3.4 mmol) 
was dissolved in water (34 ml) and the solution was 
applied to a column of dowex 50 in the H + form. The 
column was washed with water. The eluent dropped 

15 directly into a cooled (ice bath) and stirred solution 
of tri-n-butyl amine (1.6 ml, 6.8 mmol) in EtOH (14 
ml) . The column was washed until the pH of the eluent 
increased to 6. The aqueous ethanol solution was 
evaporated to dryness and then co-evaporated twice 

20 with ethanol and twice with anhydrous DMF. The residue 
was dissolved in DMF (6.7 ml). The pale yellow 
solution was stored over 4A molecular sieves. The 
nucleoside (22) and proton sponge was dried over P 2 0 5 
under vacuum overnight. A solution of (22) (104 mg, 

25 0.2 2 mmol) and proton sponge (71 mg, 0.33 mmol) in 
trimethylphosphate (0.4 ml) was stirred with 4A 
molecular sieves for 1 h. Freshly distilled P0C1 3 (25 
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0.26 mmol) was added and the solution was stirred 
at 4°C for 2h. The mixture was slowly warmed up to 
room temperature and bis (tri-n-butyl ammonium) 
pyrophosphate (1.76 ml, 0.88 mmol) and anhydrous tri- 
5 n-butyl amine (0.42 ml, 1.76 mmol) were added. After 5 
min, the reaction was quenched with 0.1 M TEAB 
(trie thy 1 ammonium bicarbonate) buffer (15 ml) and 
stirred for 3 h. The water was removed under reduced 
pressure and the resulting residue dissolved in 
10 concentrated ammonia (p 0.88, 10 ml) and stirred at 
room temperature for 16 h. The reaction mixture was 
then evaporated to dryness. The residue was dissolved 
in water and the solution applied to a DEAE-Sephadex 
A-25 column. MPLC was performed with a linear gradient 
15 of 2 L each of 0.0 5 M and 1 M TEAB. The triphosphate 
was eluted between 0.7 M and 0.8 M buffer. Fractions 
containing the product were combined and evaporated to 
dryness. The residue was dissolved in water and 
further purified by HPLC. t r (2 3) = 2 0.5 min (Zorbax 
20 C18 preparative column, gradient: 5% to 35% B in 30 
min, buffer A 0 . 1M TEAB, buffer B MeCN) . The product 
was isolated as a white foam (225 O.D., 29.6 jxmol, 
13.4%, e 260 = 7,600). X H NMR (D 2 0) 8 2.43-2.5 (m, 2H, H- 
2'), 3.85 (m, 2H, CH 2 N) , 3.97-4.07 (m, 2H, H-5'), 4.25 
25 (br s, 1H, H-4'), 4.57 (br s, 1H, H-3'), 4.74-4.78 (m, 

2H, CH 2 N 3 ) , 6.26-6.29 (m, 1H, H-l'), 7.41 (s, 1H, H- 
8). 31 P-NMR (D 2 0) 8 -8.6 (m, IP, P y ) , -10.1 (d, ^7=19.4 
Hz, IP, Pa), -21.8 (t, J = 19.4 Hz, IP, P p ) . Mass (-ve 
electrospray) calcd for C 15 H 21 N 8 0 13 P3 614, found 613. 



30 
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scv 




S0 3 H 



O^N 3 
(24) 



A mixture of disulphide Iinkered-Cy3 (2.5 jimol) , 1- (3- 
5 dimethylaminopropyl) -3-ethylcarbodiimide hydrochloride 
(EDO (0.95 mg, 5 |amol) t 1-hydroxybenzotriazole (HOBt) 
(0.68 mg, 5 ^imol) and N-methyl -morpholine (0.55 |iL, 5 
jimol) in DMF (0.9 ml) was stirred at room temperature 
for 1 h. A solution of (23) (44 O.D., 3.75 |umol) in 
10 0.1 ml water was added to the reaction mixture at 4°C / 
and left at room temperature for 3 h. The reaction was 
quenched with TEAB buffer (0.1M, 10 ml) and loaded on 
a DEAE Sephadex column (2x5 cm) . The column was 
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first eluted with 0.1 M TEAB buffer (100 ml) and then 
1 M TEAB buffer (100 ml) . The desired triphosphate 
product was eluted out with 1 M TEAB buffer. 
Concentrating the fraction containing the product and 
5 applied to HPLC. t r (24) =23.8 min (Zorbax C18 

preparative column, gradient: 5% to 55% B in 30 min, 
buffer A 0.1M TEAB, buffer B MeCN) . The product was 
isolated as a red foam (0.5 umol , 20%, e max = 150,000). 
X H NMR (D 2 0) 6 1.17-1.71 (m, 20H, 4 x CH 2 , 4 x CH 3 ) , .. 

10 2.07-2.15 (m, 1H, H-2'), 2.21-2.30 (m, 1H, H-2'), 

2.52-2.58 (m, 2H, CH 2 ) , 2.66-2.68 (m, 2H, CH 2 ) , 2.72- 
2.76 (m, 2H, CH 2 ) , 3.08-3.19 (m, 2H, CH 2 ) , 3.81-3.93 
(m, 6H, CH 2 , H-5'), 4.08-4.16 (m, 1H, H-4'), 4.45-4.47 
(m, 1H, H-3'), 4.70-4.79 (m, 2H, CH 2 N 3 ) , 6.05-6.08 (m, 

15 2H, H Ar ) , 6.15-6.18 (m, 1H, H-l'), 7.11 (s, 1H, H-8) , 
7.09-7.18 (m, 2H, CH) , 7.63-7.72 (m, 4H, H Ar ) , 8.27- 
8.29 (m, 1H, CH) . 31 P NMR (D 2 0) 8 -4.7 (m, IP, P y ) , - 
9.8 (m, IP, P„) , -19.7 (m, IP, P p ) . Mass (-ve 
electrospray) calcd for C 5 iH 66 N 11 0 2 iP 3 S4l389 . 25 , found 
20 1388 (M-H) , 694 [M-2H] , 462 [M-3H] . 




7-Deaza-7- [3- (2, 2, 2-trif luoroacetamido) -prop- 1 -ynyl] - 
25 2' -deoxyadenosine (25). 

To a suspension of 7-deaza-7-iodo-2 ' -deoxyadenosine (1 
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g, 2.65 mmol) and Cul (100 mg, 0.53 mmol) in dry DMF 
(20 ml) was added triethylamine (740 pi, 5.3 mmol). 
After stirring for 5 min trif luoro-iV-prop-2 -ynyl - 
acetamide (1.2 g, 7.95 mmol) and Pd(PPh 3 )4 (308 mg, 
5 0.26 mmol) were added to the mixture and the reaction 
was stirred at room temperature in the dark for 16 h. 
MeOH (40 ml) and bicarbonate dowex was added to the 
reaction mixture and stirred for 45 min. The mixture 
was filtered. The filtrate washed with MeOH and the. . 

10 solvent was removed under vacuum. The crude mixture 
was purified by chromatography on silica (EtOAc to 
EtOAc : MeOH 95:20) to give slightly yellow powder (25) 
(1.0 g, 95 %) . l H NMR (d 6 DMSO) 6 2.11-2.19 (m, 1H, H- 
2'), 2.40-2.46 (m, 1H, H-2 ' ) , 3.44-3.58 (m, 2H, H-5'), 

15 3.80 (m, 1H, H-4'), 4.29 (m, 3H, H-3', CH 2 N) , 5.07 (t, 
J = 5.5 Hz, 1H, OH) , 5.26 (d, J = 4 . 0 Hz, 1H, OH) , 
6.45 (dd, J = 6.1, 8.1 Hz, 1H, H-l'), 7.74 (s, 1H, H- 
8) , 8.09 (s, 1H, H-2) , 10.09 (t, J = 5.3 Hz, 1H, NH) . 



20 




5' -O- (tert-Butyldiphenylsilyl) -7-deaza-7- [3- (2,2,2- 
trif luoroacetamido) -prop-l-ynyl] -2' -deoxyadenosine 
(26) . 

The nucleoside (25) (1.13 g, 2.82 mmol) was 
coevaporated twice in dry pyridine (2 x 10ml) and 
dissolved in dry pyridine (18 ml) . To this solution 
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was added t-butyldiphenylsilylchloride (748 
2.87 mmol) in small portions at 0°C. The reaction 
mixture was let to warm up at room temperature and 
left stirring overnight. The reaction was quenched 
5 with sat. aq. NaCl solution. EtOAc (25 ml) was added 
to reaction mixture and the aqueous layer was 
extracted with EtOAc three times. After drying the 
combined organic extracts (MgS0 4 ) the solvent was 
removed under vacuum. Purification by chromatography 

10 on silica (DCM then EtOAc to EtOAc : MeOH 85:15) gave 
(26) as a slightly yellow powder (1.76 g, 97 %) . 1 H 
NMR (d 6 DMSO) 6 1.03 (s, 9H, tBu) , 2.25-2.32 (m, 1H, 
H-2'), 2.06-2.47 (m, 1H, H-2'), 3.71-3.90 (m, 2H, H- 
5'), 3.90-3.96 (m, 1H, H-4'), 4.32 (m, 2H, CH 2 N) , 4.46 

15 (m, 1H, H-3'), 5.42 (br s, 1H, OH), 6.53 (t, 

J m 6.1 Hz, 1H, H-l'), 7.38-7.64 (m, 11H, H-8 and 
H Ar ) , 8.16 (s, 1H, H-2), 10.12 (t, J = 5.3 Hz, 1H, 
NH) . 




(27) 



5' -O- (tert-Butyldiphenylsilyl) -7 -deaza-4-^, N- 
dimethylformadin-7- [3- (2, 2, 2-trif luoroacetamido) -prop- 
25 1-ynyl] -2 9 -deoxy adenosine (27). A solution of the 

nucleoside (26) (831 mg, 1.30 mmol) was dissolved in a 
mixture of MeOH : N, N-dimethylacetal (30 ml: 3ml) and 
stirred at 4 0°C. The reaction monitored by TLC, was 
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complete after 1 h. The solvent was removed under 
vacuum. Purification by chromatography on silica 
(EtOAc : MeOH 95:5) gave (27) as a slightly brown 
powder (777 mg, 86 %) . X H NMR (d 6 DMSO) 6 0.99 (s, 9H, 
5 tBu) , 2.22-2.29 (m, 1H, H-2'), 2.50-2.59 (m, 1H, H- 
2'), 3.13 (s. 3H, CH 3 ) , 3.18 (s. 3H, CH 3 ) , 3.68-3.87 
(m, 2H, H-5'), 3.88-3.92 (m, 1H, H-4'), 4.25 (m f 2H, 
CH 2 N) , 4.43 (m, 1H, H-3'), 6.56 (t, J = 6.6 Hz, 1H, H- 
1'), 7.36-7.65 (m, 10H, H Ar ) , 7.71 (s, 1H, H-8) , 8.33 
10 (s, 1H, CH) , 8.8 (s, 1H, H-2), 10.12 (t, cJ= 5.3 Hz, 

1H, NH) . 




(28) 



15 5' -O- (terfc-Butyldiphenylsilyl) -7 -deaza-4 -^JV- 
dimethylf ormadin- 3 ' -O-methylthioxnethoxy-7- [3- (2,2,2 - 
trif luoroacetamido) -prop-l-ynyl] -2' -deoxyadenosine 
(28). 

20 To a solution of (27) (623 mg, 0.89 mmol) in dry DMSO 
(8 ml) was added acetic acid (775 fil , 13.35 mmol) and 
acetic anhydride (2.54ml, 26.7 mmol). The mixture was 
stirred overnight at room temperature. The reaction 
was then poured into EtOAc and sat. aq. NaHC0 3 (1:1) 

25 solution and stirred vigorously. The organic layer was 
washed one more time with sat. aq. NaHC0 3 and dried 
over MgS0 4 . After removing the solvent under reduced 
pressure, the product (28) was purified by 
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chromatography on silica (EtOAc : petroleum ether 1:2, 
then EtOAc) yielding (28) (350 mg, 52 %) . *H NMR (d 6 
DMSO) : 6, 1.0 (S, 9H, tBu) , 2.09 (s, 3H, SCH 3 ) , 2.41- 
2.48 (m, 1H, H-2'), 2.64-2.72 (m, 1H, H-2'), 3.12 (s, 
5 3H, CH 3 ) , 3.17 (s, 3H, CH 3 ) , 3.66-3.89 (m, 2H, H-5'), 
4.04 (m, 1H, H-4'), 4.26 (m, J = 5.6 Hz, 2H, CH 2 ) , 
4.67 (m, 1H, H-3'), 4.74 (brs, 2H, CH 2 ) , 6.49 (t, J = 
6.1, 8.1Hz, 1H, H-l'), 7.37-7.48 (m, 5H, H^) , 7.58- 
7.67 (m, 5H, H Ar ) , 7.76 (s, 1H, H-8) , 8.30 (s, 1H, 
10 CH) , 8.79 (s, 1H, H-2), 10.05 (t, J" = 5.6 Hz, 1H, NH) . 



3' -O-Azidomethyl-5' -O- (tejrt-butyldiphenylsilyl) -7- 
15 deaza-4-N,N-dimethylformadin-7- [3- (2,2,2- 

trif luoroacetamido) -prop-l-ynyl] -2' -deoxyadenosine 
(29) . 

To a solution of (28) (200 mg, 0.26 mmol) and 
20 cyclohexene (0.13 5 ml, 1.3 mmol) in dry CH 2 C1 2 (5 ml) 
at 0 °C, sulfurylchoride (32 jil, 0.3 9 mmol) was added 
under N 2 . After 10 min, TLC indicated the full 
consumption of the nucleoside (28) . The solvent was 
evaporated and the residue was subjected to high 
25 vacuum for 2 0 min. It was then redissolved in dry DMF 
(3 ml) , cooled to 0°C and treated with NaN 3 (86 mg, 
1.3 mmol). The resulting suspension was stirred under 
room temperature for 3h. The reaction was partitioned 
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between EtOAc and water. The aqueous phases were 
extracted with EtOAc. The combined organic extracts 
were combined and dried over MgS0 4 . After removing the 
solvent under reduced pressure, the mixture was 
5 purified by chromatography on silica (EtOAc) yielding 
an oil (29) (155 mg, 80 %) . X H NMR (d 6 DMSO) : 5 0.99 
(s, 9H, tBu) , 2.45-2.50 (m, 1H, H-2'), 2.69-2.78 (m, 
1H, H-2'), 3.12 (s, 3H, CH 3 ) , 3.17 (s, 3H, CH 3 ) , 3.67- 
3.88 (m, 2H, H-5'), 4.06 (m, 1H, H-4'), 4.25 (m, 2H, 
10 CH 2 ) , 4.61 (m, 1H, H-3'), 4.84-4.97 (m, 2H, CH 2 ) , 6.58 
(t, J = 6.6 Hz, 1H, H-l'), 7.35-7.47 % (m, 5H, H Ar ) , 
7.58-7.65 (m, 5H, Hat), 7.77 (s, 1H, H-8) , 8.30 (s, 
1H, CH) , 8.79 (s # 1H, H-2), 10.05 (br s, 1H, NH) . 




(30) 



3' -0-Azidomethyl-7-deaza-4-tf,J/-dimethylformadin-7- [3- 
(2, 2, 2-trif luoroacetamido) -prop-l-ynyl] -2 # - 
deoxyadenosine (30) . 

20 

A solution of (29) (155 mg, 0.207 mmol) in solution in 
tetrahydrofuran (THF) (3 ml) was treated with TBAF (1 
M in THF, 228 \xl) at 0°C. The ice-bath was then 
removed and the reaction mixture stirred at room 
25 temperature. After 2 h-TLC indicated the full 
consumption of the nucleoside. The solvent was 
removed. . Purification by chromatography on silica 
(EtOAc :MeOH 95:5) gave (30) (86 mg, 82 %) as a pale 
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brown oil. X H NMR (d 6 DMSO) 6 2.40-2.48 (dd, J = 8.1 , 
13.6 Hz, 1H, H-2'), 2.59-2.68 (dd, J = 8.3, 14 Hz, 1H, 
H-2'), 3.12 (s, 3H, CH 3 ) , 3.17 (s, 3H, CH 3 ) , 3.52-3.62 
(m, 2H, H-5'), 4.02 (m, 1H, H-4'), 4.28 (d, J = 5.6 
5 Hz, 2H, CH 2 NH) , 4.47 (m, 1H, H-3'), 4.89 (s , 2H, 

CH 2 N 3 ) , 5.19 (t, J= 5.6 Hz, 1H, OH), 6.49 (dd, J = 
8.1, 8.7 HZ, 1H, H-l'), 7.88 (s, 1H, H-8), 8.34 (s, 
1H, CH) , 8.80 (s, 1H, H-2), 10.08 (s, 1H, NH) . 




7- (3-Axninoprop-l-ynyl) - 3' -O-azidomethyl -7-deaza-2' - 
deoxyadenosine 5 ' -O-nucleoside triphosphate (31). 

15 The nucleoside (30) and proton sponge was dried over 
P 2 O s under vacuum overnight. A solution of (30) 
(150 mg, 0.2 94 mmol) and proton sponge (126 mg, 
0.588 mmol) in trimethylphosphate (980 ^1) was stirred 
with 4A molecular sieves for 1 h. Freshly distilled 

20 P0C1 3 (3 6 jil, 0.3 88 mmol) was added and the solution 
was stirred at 4°C for 2h. The mixture was slowly 
warmed up to room temperature and bis (tri-n-butyl 
ammonium) pyrophosphate 0.5 M solution in DMF (2.35 
ml, 1.17 mmol) and anhydrous tri-n-butyl amine (560 

25 pi, 2.35 mmol) was added. After 5 min, the reaction 
was quenched with 0.1 M TEAB ( triethylammonium 
bicarbonate) buffer (15 ml) and stirred for 3h. The 
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water was removed unde-r reduced pressure and the 
resulting residue dissolved in concentrated ammonia (p 
0.88, 15 ml) and stirred at room temperature for 16 h. 
The reaction mixture was then evaporated to dryness. 
5 The residue was dissolved in water and the solution 
applied to a DEAE-Sephadex A- 2 5 column. MPLC was 
performed with a linear gradient of 0.05 M to 1 M 
TEAB. Fractions containing the product were combined 
and evaporated to dryness. The residue was dissolved. 

10 in water and further purified by HPLC. HPLC: t r (31): 

19.94 min (Zorbax C18 preparative column, gradient: 5% 
to 35% B in 20 min, buffer A 0.1M TEAB, buffer B 
MeCN) . The product (31) was isolated as a white foam 
(17.5 nmol, 5.9%, e 280 = 15000). X H NMR (D 2 0) 5 2.67- 

15 2.84 (2m, 2H, H-2'), 4.14 (m, 2H, CH 2 NH) , 4.17-4.36 

(m, 2H, H-5' ) , 4 .52 (br s, H-4' ) , 6.73 (t, J = 6.6 Hz, 
H-l'), 8.06 (s, 1H, H-8), 8.19 (s, 1H, H-2). 31 P NMR 
(D 2 0) 5 -5.07 (d, J = 21.8 Hz, IP, P Y ) , -10-19 (d, J 
= 19.8 Hz, IP, P a ), -21.32 (t, J*= 19.8 Hz, IP, P p ) . 

20 Mass (-ve electrospray) calcd for Ci 5 H 2 iN 8 Oi 2 P 3 598.05, 
found 596. 
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(32) 



To the Cy3 disulphide linker (1.3 ^mol) in solution in 
5 DMF (450 fil) is added at 0°C 50 nl of a mixture of 1- 
(3-dimethylaminopropyl) -3 -ethylcarbodiimide 
hydrochloride, 1 -hydroxybenzotriazole hydrate and N- 
methylmorpholine (26 jiM each) in DMF. The reaction 
mixture wais stirred at room temperature for 1 h. The 
10 reaction was monitored by TLC (MeOH:CH 2 Cl 2 3:7) until 
all the dye linker was consumed. Then DMF (4 00 |il) 
was added at 0°C, followed by the nucleotide (31) (1.2 
(amol) in solution in water (100 |il) and the reaction 
mixture and stirred at room temperature overnight. TLC 
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(MeOH:CH 2 Cl 2 4:6) showed complete consumption of the 
activated ester and a dark red spot appeared on the 
baseline. The reaction was quenched with TEAB buffer 

(0.1M, 10 ml) and loaded on a DEAE Sephadex column (2 
5 x 5 cm) - The column was first eluted with 0.1 M TEAB 
buffer (100 ml) to wash off organic residues and then 
1 M TEAB buffer (100 ml) . The desired triphosphate 

(32) was eluted out with 1 M TEAB buffer. The fraction 
containing the product were combined, evaporated and 
10 purified by HPLC. HPLC conditions: t r (32): 22.44 min 

(Zorbax C18 preparative column, gradient: 5% to 35% B 
in 20 min, buffer A 0 . 1M TEAB, buffer B MeCN) . The 
product was isolated as dark pink solid (0.15 jamol , 
12.5%, 6sso = 150000). l H NMR (D 2 0) 8 2.03 (t, 2H, CH 2 ) , 
15 2.25 (m, 1H, H-2'), 2.43 (m, 1H, H-2'), 2.50 (m, 2H, 

CH 2 ) , 2.66 (m, 2H, CH 2 ) , 3.79 (m, 2H CH 2 ) , 3.99 (m, 4H, 
CH 2 N, H-5'), 4.18 (br s, 1H, H-4'), 6.02, 6.17 (2d, J 
= 13.64 Hz, 2H, H Ar ) , 6.30 (dd, J = 6.06, 8.58 Hz, H- 
1'), 7.08, 7.22 (2d, 2H, 2 x =CH) , 7.58-7.82 (m, 5H, 
20 H Ar , H-2, H-8), 8.29 (m, =CH) . n P NMR (D 2 0) 5 -4.83 (m, 
IP, P Y ) , -10.06 (m, IP, P a ), -20.72 (m, IP, P 3 ) . 

Enzyme Incorporation of 3 ' -Azidomethyl dNTPs 

To a 100 nM DNA primer/template (primer previously 
25 labelled with P32 and T4 polynucleotide kinase) in 

Tris-HCl pH 8.8 50 mM, Tween-20 0.01%, and MgS0 4 4 mM, 
add 2 |±M compound 6 and 100 nM polymerase 
(Thermococcus sp. 9°N exo "Y4 0 9V A485L supplied by New 
England Biolabs) . The template consists of a run of 
30 10 adenine bases to show the effect of the block. The 
reaction is heated to 65 C for 10 mins. To show 
complete blocking, a chase is performed with the four 
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native, unblocked nucleoside triphosphates. 
Quantitative incorporation of a single azidomethyl 
blocked dTTP can be observed and thus the azidomethyl 
group can be seen to act as an effective block to 
5 further incorporation. 

By attaching a hairpin DNA (covalently attached self 
complementary primer/ template) to a streptavidin bead 
The reaction can be performed over multiple cycles as 
10 shown in figures 5 and 6. 

Preparation of the streptavidin beads 

Remove the storage buffer and wash the beads 3 times 
with TE buffer (Tris-HCl pH 8, 10 mM and EDTA, 1 mM) . 

15 Resuspend in B & W buffer (10 mM Tris-HCl pH 7.5, 1 mM 
EDTA and 2.0 M NaCl) , add biotinylated 32 P labelled 
hairpin DNA with appropriate overhanging template 
sequence. Allow to stand at room temperature for 15 
minutes. Remove buffer and wash beads 3 times TE 

20 buffer. 

Incorporation of the fully functional nucleoside 
triphosphate (FFN) 

25 To a solution of Tris-HCl pH 8.8 50 mM, Tween-20 

0.01%, MgS0 4 4 mM, MnCl 2 0.4 mM (except cycle 1, 0.2 
mM) , add 2 yM FFN and 100 nM polymerase. This solution 
is then added to the beads and mixed thoroughly and 
incubated at 65°C for 10-15 minutes. The reaction 

30 mixture is removed and the beads washed 3 times with 
TE buffer. 
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Deblocking step 

Tris- (2-carboxyethyl) phosphines trisodium salt (TCEP) 
(0.1M) is added to the beads and mixed thoroughly. 
The mixture was then incubated at 65°C for 15 minutes. 
5 The deblocking solution is removed and the beads 
washed 3 times with TE buffer. 

Capping step 

Iodoacetamide (431 mM) in 0.1 idM phosphate pH 6.5 is. 
10 added to the beads and mixed thoroughly, this is then 
left at room temperature for 5 minutes . The capping 
solution is removed and the beads washed 3 times with 
TE buffer. 

15 Repeat as required 

The reaction products can be analysed by placing the 
bead solution in the well of a standard 12% 
polyacrylamide DNA sequencing gel in 4 0% formamide 

20 loading buffer. Running the gel under denaturing 
conditions causes the DNA to be released from the 
beads and onto the gel. The DNA band shifts are 
affected by both the presence of dye and the addition 
of extra nucleotides and thus the cleavage of the dye 

25 (and block) with the phosphine cause a mobility shift 
on the gel . 

Two cycles of incorporation with compounds 18 (C) , 24 
(G) and 32 (A) and six cycles with compound 6 can be 
30 seen in figures Fig. 5^and Fig. 6. 

3»-OH protected with an allyl group: 
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Nucleotides bearing this blocking group at the 
3'position have been synthesised, shown to be 
5 successfully incorporated by DNA polymerases, block 
efficiently and may be subsequently removed under 
neutral, aqueous conditions using water soluble 
phosphines or thiols allowing further extension. 




5' -O- (t-Butyldimethylsilyl) -5-iodo-2' -deoxyuridine 
(33) . 

15 To a solution of 5-iodo-2 ' -deoxyuridine (5.0 g, 

14 mmol) in 70 ml in dry N, N-dimethylf ormamide (DMF) 
was added imidazole (1.09 g, 16 mmol), followed by 
(2,41 g, 16 mmol) TBDMSC1 at 0 °C. The mixture was 
left in the ice bath and stirred overnight. The 

20 reaction was quenched with sat. aq. NaCl solution and 
extracted with EtOAc. After drying (MgS0 4 ) , the 
solvent was removed and the crude mixture was purified 
by chromatography on silica (EtOAc : petroleum ether 
3:7). The product (33) (5.9 g, 90 %) was obtained as a 

25 colourless solid. X H NMR (d € DMSO) 8 0.00 (s, 3H, CH 3 ) , 
0.79 (s, 9H, tBu) , 1.88-1.97 (m, 1H, H-2'), 2.00-2.05 
(m, 1H, H-2'), 3.59-3.71 (m, 2H, H-5'), 3.75 (brs, 
1H, H-4' ) , 4 . 06 (br s, 1H, H-3' ) , 5.18 (d, J= 4.0 Hz, 
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1H, OH), 5.98 (t, J= 5.9Hz,lH, H-l'), 7.89 (s, 1H # 
H-6) , 11.62 (s, 1H, NH) . Mass (-ve electrospray) calcd 
for Ci 5 H2sIN 2 OsSi 468.06 found 467. 



5 




3' -O-Allyl-5' -0-t-butyldimethylsilyl-5-±odo-2' - 
deoxyuridine (34) • 

10 To a suspension of NaH (4 97 mg, 12.4 mmol , 60 % in 

mineral oil) in dry THF (20 ml) a solution of 5'-TBDMS 
protected 5 -iodo- 2 ' -deoxyuridine (2.8 g, 5.9 mmol) in 
dry THF (50 ml) was added drop wise. After the gas 
evolution had stopped the mixture was stirred for 

15 another 10 min and then allylbromide (561 fil , 

6.5 mmol) was added drop wise. After the complete 
addition the milky reaction mixture was stirred at 
room temperature for 16 h. The reaction was quenched 
by addition of sat. aq. NaCl solution (30 ml). The 

20 aqueous layer was extracted three times using EtOAc 
and after washing with sat. aq. NaCl solution the 
organic phase was dried (MgS0 4 ) . After removing of the 
solvents the crude product was purified by 
chromatography (EtOAc : petroleum ether 1:1). The 

25 allylated product (2.39 g, 80 %) was obtained as a 

colourless foam. X H NMR (d 6 DMSO) S -0.01 (s, 3H, CH 3 ) , 
0.78 (s, 9H, tBu) , 1.94-2.01 (m, 1H, H-2'), 2.16-2.21 
(m, 1H, H-2'), 3.61-3.71 (m, 2H, H-5'), 3.87-3.94 (m, 
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4H, H-3', H-4', OCH 2 ) , 5.04 (dd, J= 1.6, 10.4 Hz, 1H, 
=CH 2 ) , 5.15 (dd, J = 1.8/ 17.3 Hz, 1H, =CH 2 ) , 5.72- 
5.81 (m, 1H, CH=) , 5.92 (t, J = 5.7 Hz, 1H, H-l'), 
7.88 (s, 1H, 6-H), 11.6 (s, 1H, NH) . Mass (-ve 
5 electrospray) calcd for Ci 8 H 2 9lN20 5 Si 508.09, found 507. 



TBDMSi 




3' -0-Allyl-5-iodo-2' -deoxyuridine (35) . 

10 

To a solution of (34) (2.34 g, 4.71 mmol) in dry THF 
(40 ml) was added at 0 °C TBAF (5.2 ml, 5.2 mmol, 1 M 
solution in THF) . The reaction mixture was allowed to 
warm up to room temperature and was then stirred for 

15 16 h. The reaction was quenched by adding sat. NaCl 

solution (20 ml) and extracted with EtOAc three times. 
The combined organic layers were dried over MgS0 4 . 
The crude mixture was purified by chromatography on 
silica (EtOAc:petrol 7:3). Product (35) (1.4 g, 75 %) 

20 was isolated as a colourless solid. X H NMR (d 6 DMSO) 8 
2.02-2.39 (m, 2H, H-2'), 3.42-3.52 (m, 2H, H-5'), 
3.84-3.88 (m, 3H, H-4', CH 2 ] , 3.97-4.00 (m, 1H, H-3'), 
5.02-5.09 (m, 2H, OH, =CH 2 ) , (dd, J = 1.9, 17.3 Hz, 
1H, =CH 2 ) , 5.73-5.82 (m, 1H, CH=) , 5.94 (t, J" = 

25 6.8 Hz, 1H, H-l'), 8.24 (s, 1H, H-6) , 11.56 (s, 1H, 

NH) . Mass (-ve electrospray) calcd for C 12 Hi 6 IN 2 0 5 
394 . 0 . found 393 . 
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3' -O-Allyl-5- [3- (2, 2, 2-trif luoroacetamido) -prop-1- 
ynyl] -2' -deoxyuridine. 

5 

To a solution of (35) (400 mg, l.o mmol) in dry DMF 
(10 ml) was added Cul (38 mg, 20 ^imol) and 
triethylamine (300 y.1 , 2.0 mmol). The 
propargyltrif luoroacet amide (453 mg, 3.0 mmol) was 
10 added drop wise, followed by Pd(PPh 3 ) 4 (110 mg, 

9.5 fimol) . The reaction was stirred for 16 h in the 
dark. The reaction was quenched by adding MeOH 
(10 ml) , DCM (10 ml) and bicarbonate dowex. The 
mixture was stirred for 30 min and then filtered. The 

15 solvents were removed under vacuum and the crude 
product was purified by chromatography on silica 
(EtOAc : petrol 3 : 7 to 7 : 3) . The product was 
isolated as slightly yellow crystals (398 mg, 95 %) . 
1 H NMR (d 6 DMSO) 8 2.25-2.43 (m, 2H, H-2'), 3.65-3.76 

20 (m, 2H, H-5'), 4.07-4.17 (m, 3H, H-4', CH 2 ) , 4.21-4.23 

(m, 1H, H-3'), 4.34 (d, J = 5.5 Hz, 2H, CH 2 N) , 5.25- 
5.27 (m, 2H, =CH 2 , OH), 5.38 (dd, J = 1.83, 17.3 Hz, 
1H, =CH 2 ) , 5.96-6.06 (m, 1H, =CH) , 6.17 (t, J - 
6.9Hz, 1H, H-l'), 8.29 (s, 1H, H-6) , 10.17 (t, J = 

25 5.5Hz, 1H, NHTFA) , 11.78 (s, 1H, NH) . Mass (-ve 

electrospray) calcd for C 17 H 18 F 3 N306 417.11, found 416. 
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3 ' -O-Allyl-5- [3- (2 , 2 , 2 - trif luoroacetamido) -prop-1- 
ynyl] -2 ' -deoxyuridine 5 ' -O-nucleoeide triphosphate 
5 (37) . 

Under nitrogen (36) (100 mg, 0.24 mmol) and proton 
sponge (61.5 mg, 0.28 mmol), both dried under vacuum 
over P 2 0 5 for 24 h, were dissolved in OP(OMe) 3 

10 (225 jxl). At 0°C freshly distilled POCl 3 was added drop 
wise and the mixture was stirred for 1.5 h. Then 
pyrophosphate (1.44 ml, 0.72 ^imol, 0.5 M in DMF) and 
nBu 3 N (0.36 ml, 1.5 mmol) were added and the resulting 
mixture stirred for another 1.5 h. Triethyl ammonium 

15 bicarbonate solution (4.5 ml, 0.1 M solution, TEAB) 
was added and the reaction mixture was left stirring 
for 2 h. Then aq. NH 3 (4.5 ml) was added and the 
mixture was stirred for 16 h. After removing the 
solvents to dryness, the residue was redissolved in 

20 water, filtered and purified by MPLC, followed by HPLC 
purification. The desired triphosphate (37) (10.2 
jimol, 4 %, e 2 ao = 10000) was isolated as a colourless 
foam. MPLC conditions: a gradient was run from 0.05M 
TEAB to 0.7 M TEAB using 2 1 of each on a DEAE 

25 sephadex column. The product containing fractions came 
off with -0.4 M TEAB. After removing the solvent, the 
product was HPLC purified. HPLC conditions: 
t r (triphosphate) : 21.9 min (Zorbax C-18 preparative 
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10 



column, buffer A 0.1 M TEAB , buffer B 0.1 M TEAB + 
30 % Ace tonit rile, gradient 5 - 35 % buffer B in 
35 min) . 1 H NMR (D 2 0) 8 2.17-2.23 (m, 1H, H-2'), 2.40- 
2.45 (m, 1H, H-2'), 3.67 (s, 2H, CH 2 N) , 3.99 (d, 
J = 5.9 Hz, 2H, OCH 2 ) , 4.02-4.17 (m, 2H, H-5'), 4.25 
(br S, 1H, H-4'), 4.32-4.33 (m, 1H, H-3'), 5.13 (d, 
J = 10.3 Hz, 1H, =CH 2 ) , 5.23 (d, J =17.2 Hz, 1H, 
=CH 2 ) , 5.78-5.88 (m. 1H, =CH) , 6.16 (t, J = 6.7 Hz, 
1H, H-l'), 8.33 (s, 1H, H-6) . 31 P NMR (161.9 MHz, D 2 Q) 
5 -21.3 (t, J" = 19.5 Hz, IP, P y ) , -10.3 (d, J =19 Hz, 
IP, P a ), -7.1 <d, J = 15.5 Hz, IP, P P ) . Mass (-ve 
electrospray) calcd for C 15 H 22 N30 14 P3 561.03, found 560, 
480 [M-phosphate] , 401 [M-2x phosphate] . 



15 




so 3 h 



SO3H 



(38) 



To a solut ion of Cy3 disulfide linker (2.5 fimol) in 
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DMF (0.2 ml) at 0 °C was added. Disuccinimidyl 
carbonate (0.96 mg 3.75 |xmol) and 4- (dimethylamino) 
pyridine (DMAP) (0.46 mg 3.75 (imol) . The reaction 
mixture was stirred for 10 min and then checked by TLC 
5 (MeOH : DCM 3:7) (activated ester r f = 0.5). In a 

separate flask the 3'-0-allyl thymidine triphosphate 
(37) (532 pi, 14.1 mM in water, 7.5 fimol) were mixed 
with Bu 3 N (14 3 |il) and evaporated to dryness. After 
this the triphosphate (37) was dissolved in dry DMF 
10 (0.2 ml). To the triphosphate (37) solution at 0 °C 
was added the activated dye and the reaction mixture 
was allowed to warm to room temperature and then 
stirred for 16 h. The solvent was removed and the 
residue was dissolved in water. The reaction mixture 
15 was passed through a small DEAE sephadex column (2x5 
cm) using 0.1 M TEAS (100 ml) to remove the coupling 
reagents and unreacted linker. With 1 M TEAB (100 ml) 
the triphosphate (38) was eluted. The mixture was then 
separated by HPLC. Yield: 1.41 ^imol (56 %, e 550 = 
20 150000) product as a dark red solid were isolated. 
HPLC conditions: t r (38): 19.6 min (Zorbax C-18 
preparative column, buffer A 0.1 M TEAB, buffer B 
Acetonitrile, gradient: 2-58 % buffer B in 29 min). X H 
(d 6 DMSO) 5 0.75-0.79 (m, 3H, CH 3 ) , 1.17-1.28 (m, 2H, 
25 CH 2 ) , 1.48-1.55 (m, 2H, CH 2 ) , 1.64 (s, 12H, 4 xCH 3 ), 

1.70-1.77 (m, 2H, CH 2 ) , 1.96-2.02 (m, 1H, H-2'), 2.07- 
2.11 (m, 2H, CH 2 ) , 2.25-2.30 (m, 1H, H-2'), 2.51-2.55 
(m, 2H, CH 2 ) , 2.64-2.68 (m, 2H, CH 2 ) , 2.75-2.81 (m, 
2H, CH 2 ) , 3.27-3.31 <m,^2H, CH 2 ) , 3.91-4.05 (m, 9H, H- 
30 5', OCH 2/ NCH 2 , 2 x NCH 2 -dye) , 4.13 (s, 1H, H-4'), 
4.22-4.24 (m, 1H, H-3'), 5.06 (d, J = 10.5Hz, 1H, 
=CH 2 ) , 5.15 (dd, J = 1.4 Hz, 17.3 Hz, 1H, =CH 2 ) , 5.72- 
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5.82 (m, 1H, =CH) , 6.03-6.06 (m, 1H, H-l'), 6.20-6.29 
(m, 2H, aH) , 7.23- 7.31 (m, 2H, H Ar ) , 7.63-7.79 (m, 

5H, H-6, 4x H Ar ) , 8.31-8.45 (m, 1H, 0H) . 31 P 
(161.9 MHz, d 6 DMSO) 5 -20.2 (m, IP, P p ) , -10.0 (d, 
5 J 18.5 Hz, IP, P a ) , -4.8 (d, J* 19.5 Hz, IP, P y ) . 

Mass <-ve electrospray) calcd for C S1 H67S 4 N 6 022P3 

1336.24, found 1335.1, 688.1 [cleaved disul fide (dye), 

647.9 [cleaved disulfide (nucleotide)]. 

10 Enzyme Incorporation of compound 38 

To a 100 nM DNA primer/template (primer previously 
labelled with P32 and T4 polynucleotide kinase) in 
Tris-HCl pH 8.8 50 mM, Tween-20 0.01%, and MgS0 4 4 mM, 
add 2 nM compound 3 8 and 100 nM polymerase 

15 (Thermococcus sp. 9°N exo "Y409V A485L supplied by New 

England Biolabs) . The template consists of a run of 
10 adenine bases to show the effect of the block. The 
reaction is heated to 65 C for 10 mins. To show 
complete blocking, a chase is performed with the four 

20 native, unblocked nucleoside triphosphates. 

Quantitative incorporation of the allyl block can be 
observed (see Fig. 7) and this can be seen to act as 
an effective block to further incorporation. 




5 # -O- (tert-Butyldimethylsilyl) -S-iodo-2' - 
deoxycytidine (39) . 
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To a solution of 5-iodo-2 ' -deoxycytidine (2.2 g, 6.23 
mmol) in DMF (130 ml) was added imidazole (467 mg, 
6.85 mmol) . The mixture was cooled at 0 °C and tert- 
5 butyldimethylsilyl chloride (TBDMSCl) (1.33 g, 6.85 
mmol) added over 5 minutes. After 18 h at room 
temperature, the volatiles were evaporated under 
reduced pressure and the residue purified by flash 
chromatography on silica gel with EtOAc : MeOH (95:5 to 

10 90:10) to give the expected product (39) (2.10 g, 

72%) together with unreacted starting material (490 
mg) . X H NMR (d 6 DMSO) 8 0.11 (s, 3H, CH 3 ) , 0.12 (s, 3H, 
CH 3 ) , 0.89 (s, 9H, 3CH 3 ) , 1.90 (ddd, J = 13. 2, 7.7 and 
5.7 Hz, 1H, Htf-2'), 2.18 (ddd, J = 13.2, 5.7 and 2.3 

15 Hz, 1H, flH-2'), 3.72 (dd, J* = 11.5, 3.6 Hz, 1H, HH- 
5'), 3.80 (dd, J = 11.5, 2.8 Hz, 1H, HH-5'), 3.86- 
3.89 (m, 1H, H-4'), 4.14-4.18 (m, 1H, H-3'), 5.22 
(1H, d, J = 4.1 Hz, OH), 6.09 (1H, dd, J - 7.8, 5.8 
Hz, H-l'), 6.60 (br s, 1H, NHtf) , 7.81 (br s, 1H, 

20 NflH) , 7.94 (s, 1H, H-6); MS (ES) : m/z (%) (M+H) 468 
(90%) . 



TBDMS* 




25 3' -O-Allyl-5' -O- (tert -butyldimethylsilyl) -5-iodo-2' - 
deoxycytidine (40) . 

To a solution of NaH (60 %, 113 mg, 2.84 mmol) in THF 
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(26 ml) under N 2 atmosphere, was slowly added a 
solution of the starting nucleoside (39) (669 mg, 1.43 
mmol) in THF (6 ml) . The mixture was stirred at room 
temperature for 45 minutes, cooled at 0 °C and allyl 
5 bromide (134 jaL, 1.58 mmol) was slowly added. After 15 
h at room temperature, the solution was cooled to 0 °C 
and quenched by addition of H 2 0 (5 ml) . THF evaporated 
under reduced pressure and the product extracted into 
EtOAc (3 x 25 ml). Combined organic extracts were 

10 dried (MgSOJ filtered and the volatiles evaporated 
under reduced pressure to give a residue that was 
purified by flash chromatography on silica gel with 
EtOAc affording the expected 3'-0-allyl product (40) 
(323 mg, 44%) as a colourless oil, together with some 

15 unreacted starting material (170 mg) ; X H NMR (d 6 DMSO) 
5 0.00 (s, 3H, CH 3 ), 0.01 (s, 3H, CH 3 ) , 0.79 (s, 9H, 
3CH 3 ) , 1.84 (ddd, J = 13.3, 8.2 and 5.5 Hz, 1H, H-2'), 
2.20-2.25 (m, 1H, H-2'), 3.62-3.72 (m, 2H, H-5'), 
3.88-3.93 (m, 4H, H-3',4', HHC-CH=) , 5.1 (dd, J* = 8.5, 

20 1.7 Hz, 1H, CH=CHH) , 5.16 (dd, J* = 17.2, 1.7 Hz, 1H, 

CH=CHH) , 5.75-5.83 (m, 1H, Ctf=CHH) , 5.94 (dd, J = 8.4, 
5.6 Hz, 1H, H-l'), 6.53 (br s, 1H, NHH) , 7.74 (br s, 
1H, Niffl) , 7.83 (s, 1H, H-6); MS (ES) : m/z (%) (M-H) 
506 (100%) . 

25 



TBDMS 




3' -0-Allyl-5-iodo-2' -deoxycytidine (41) . 
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To a solution of the starting nucleoside (40) (323 mg, 
0.64 mmol) in THF (15 ml) under N 2 protected 
atmosphere was added at room temperature 
5 tetrabutylammonium fluoride (TBAF) 1M in THF (0.7 ml, 
0.7 mmol) . Mixture stirred for one hour and then 
quenched by addition of H 2 0 (5 ml) . THF was evaporated 
and aqueous residue extracted into EtOAc (3 x 25 ml) . 
Combined organic extracts were dried (MgS0 4 ) , filtered 

10 and the volatiles evaporated under reduced pressure 
giving a crude material which was purified by flash 
chromatography on a pre-packed silica column eluted 
with EtOAc. The product (41) was obtained as a white 
solid (233 mg, 93%). X H NMR (d 6 DMSO) 8 1.96-2.05 (m, 

15 1H, H-2') 2.24 (ddd, J = 13.5, 5.8 and 2.8 Hz, 1H, H- 
2'), 3.50-3.62 (m, 2H, H5'), 3.91-3.97 (m, 2H, 
H3',H4'), 4.03-4.07 (m, 2H, JiHC-CH=) , 5.11-5.16 (m, 
2H, OH, CH=CHH) , 5.24 (dd, J = 17.2, 1.6 Hz, 1H, 
CH=CWH) , 5.82-5.91 (m, 1H, CH=CHH) , 6.02 (dd, oT=7.6, 

20 6.0 Hz, 1H, H-l'), 6.60 (s, 1H, NHH) , 7.79 (s, 1H, 

NflH) , 8.21 (s, 1H, H-6) . MS (ES) : m/z (%) (M-H) 392 
(100%) . 




3' -O-Allyl-5- [3- (2 , 2, 2-trif luoroacetamide) -prop-1- 
ynyl] -2' -deoxycytidine (42) . 
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To a solution of the starting nucleoside (41) (200 mg, 
0.51 mmol) in dry DMF (8.5 ml) at room temperature and 
Argon atmosphere, was slowly added Cul (19 mg, 0.10 
mmol), NEt 3 (148 nl , 1.02 mmol), 2 , 2 , 2-trif luoro-N- 
5 prop-2-ynyl-acetamide (230 mg, 1.53 mmol) and 

Pd(PPh 3 ) 4 (58 mg, 0.05 mmol). The mixture was stirred 
at room temperature and protected from light during 
four hours, quenched by addition of dowex bicarbonate 
and stirred for a 1 h, then filtered and the volatiles 

10 evaporated under reduced pressure. The residue was 

further evaporated from MeOH (15 ml) and then purified 
by flash chromatography on silica gel (CH 2 C1 2 , 
CH 2 Cl 2 :EtOAc 1:1, EtOAc : MeOH 97.5:2.5). The expected 
product (42) was obtained as a beige solid (180 mg, 

15 85%). X H NMR (d 6 DMSO) 5 1.90 (ddd, J = 13.6, 7.7 and 
6.0 Hz, 1H, H-2'), 2.16 (ddd, J = 13.6, 5.7 and 2.4 
Hz, 1H, H-2'), 3.42-3.50 (m, 2H, H-5'), 3.84-3.87 (m, 
3H, H-4', OHHC-CH=) , 3.94-3.96 (m, 1H, H-3'), 4.16 (d, 
J m 5.1 Hz, 2H, H 2 C-N) , 4.98-5.05 (m, 2H, OH, CH=CHH) , 

20 5.14 (dd, J = 17.3, 1.7 Hz, 1H, CH=CflH) , 5.72-5.82 (m, 
1H, Ctf=CHH) , 5.95 (dd, J = 7.7, 5.8 Hz, 1H, H-l'), 
6.74 (br s, 1H, NHH) , 7.72 (br s, 1H, NHH) , 8.01 (1H, 
s, H-6), 9.82 (br t, 1H, HN-CH 2 ) . MS (ES) : m/z (%) 
(M-H) 415 (100%) . 

25 




3' -O-Allyl-5- (3 -amino -prop- 1-ynyl) - 5' -O- triphosphate- 
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2' -deoxycytidine (43) . 

To a solution of the nucleoside (42) (170 mg, 0.41 
mmol) and proton sponge (10 5 mg, 0.50 mmol) (both 
5 previously dried under P 2 0 s for at least 24 h) in 

PO(0Me) 3 (360 jil ) , at 0 °C under Argon atmosphere, was 
slowly added P0C1 3 (freshly distilled) (50 |il, 0.54 
mmol) . The solution was vigorously stirred for 3 h at 
0 °C and then quenched by addition of tetra- 

10 tributylammonium diphosphate 0 . 5 M in DMF (3.20 ml, 

1.60 mmol), followed by nBu 3 N (0.75 ml, 3.2 mmol) and 
triethylammonium bicarbonate (TEAB) 0.1 M (12 ml). The 
mixture was stirred at room temperature for 3 h and 
then an aqueous ammonia solution (p 0.88 1.0 ml) (12 

15 ml) was added. The solution was stirred at room 
temperature for 15 h, volatiles evaporated under 
reduced pressure and the residue was purified by MPLC 
with a gradient of TEAB from 0.0 5M to 0.7M. The 
expected triphosphate (43) was eluted from the column 

20 at approx. 0.51 M TEAB. A second purification was done 
by HPLC in a Zorbax SB-C18 column (21.2 mm i.d. x 25 
cm) eluted with 0 . 1M TEAB (pump A) and 30% CH 3 CN in 
0.1M TEAB (pump B) using a gradient as follows: 0-5 
min 5% B, 0.2 ml; 5-25 min 80% B, 0.8 ml; 25-27 min 

25 95 %B, <J>.8 ml; 27-30 min 95 %B, 0.8 ml; 30-32 min 5 
%B, <D.8 ml; 32-35 min 95 %B, 0 . 2 ml, affording the 
product (43) detailed above with a t r (43): 20.5 (20 
Hmols, 5% yield); 31 P NMR (D 2 0) 5 -6.01 (d, J = 19.9 
H2, IP, P T ) , -10.24 (d, J =19.3 Hz, IP, P a ) , -21.00 (t, 

30 J =19.6 Hz, IP, P P ) ; X H NMR (D 2 0) 8 2.19-2.26 (m, 1H, 

H-2'), 2.51 (1H, ddd, J = 14.2, 6.1 and 3.2 Hz, H-2'), 
3.96-4.07 (m, 4H, NCH 2 , OHHC-CH=) , 4.09-4.14 (m, 1H, 
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1H, H-5') 4.22-4.26 (m, 1H, H-5'), 4.30-4.37 (m, 2H, 
H-3', 4'), 5.20 (d, J = 10.4 Hz, 1H, CH=CHH) , 5.30 
(1H, dd, J = 17.3, 1.5 Hz, CH=CHH) , 5.85-5.95 (m, 1H, 
CH=CHH) , 6.18 (t, iJ = 6.5 Hz, 1H, H-l'), 8.40 (s, 1H, 
5 H-6) ; MS (ES) : m/z (%) (M-H) 559 (100%). 




(44) 

10 To a solution of Alexa Fluor 488 disulfide linker 
(2.37mg, 3.4 fimol) in DMF (500 |il) was added N,N- 
disuccinimidyl carbonate (1.3 mg, 5.1 jimol) and 4-DMAP 
(0.6 mg, 5.1 jimol) . The mixture was stirred for 10 
minutes, then it was added into the solution of the 

15 nucleotide (43) (3.23 mg, 5.8 fimol) in DMF (100 

containing nBu 3 N (30 |il) . The mixture was continuously 
stirred for 16h at room temperature. The volatiles 
were evaporated under reduced pressure and the residue 
was firstly purified by passing it through a short ion 
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exchange resin Sephadex -DEAE A-25 (40-120 n) - column, 
first eluted with TEAB 0.1 M (70 ml) then 1.0 M TEAB 
(100 ml) . The latest containing the expected product 
(44) was concentrated and the residue was HPLC 
5 purified in a Zorbax SB-C18 column (21.2 mm i.d. x 25 
cm) eluted with 0 . 1M TEAB (pump A) and CH 3 CN (pump B) 
using a gradient as follows: 0-2 min 2% B, <J>.2 ml; 2-4 
min 2% B, <D.8 ml; 4-15 min 23 %B, 0.8 ml; 15-24 min 
23 %B, 0.8 ml; 24-26 min 95 %B, <D.8 ml; 26-28 min 95 

10 %B, 0>.8 ml, 28-30 min 2 %B, <D . 8 ml, 30-33 min 2 %B, 
<D.2 ml affording the product detailed above with a 
r t (44) : 19.9 (0.56 umols, 17% yield based on UV 
measurement); = 493 nm, e 71,000 cm" 1 M" 1 in H 2 0) ; 

31 P NMR (D 2 0) 8 -5.07 (d, J - 22.2 Hz, IP, P x ) , -10.26 

15 (d, J = 19.4 Hz, IP, P a ), -21.09 (t, J = 19.7 Hz, IP, 

Pp); *H NMR (D 2 0) 6 2.44-2.26 (m, 2H, HH-2'), 2.50 (t, 
vJ = 6.7 Hz, 2H, CH 2 ) , 2.83 (4H, CH 2 , CH 2 ) , 3.58 (t, J = 

6.0 Hz, 2H, CH 2 ) , 4.07-3.91 (m, 6H, HH-5' , NCH 2 , OHHC- 
CH=) , 4 .16-4 . 12 (m, 1H, H-4'), 4.23-4.17 (m, 1H, H-3'), 

20 5 . 24 - 5 . 09 (m, 2H, CH=CHH, CH=CHH) , 5.84-5.74 (m, 1H, 
CH=CHH) , 5.98 (t, J= 8.1 Hz, 1H, H-l'), 6.79 (d, J" = 

9.1 Hz, 1H, H Ar ) , 6.80 (d, J" = 9.3 Hz, 1H, H Ar ) , 7.06 
(t, J = 8.8 Hz, 2H, H Ar ) , 7.55 (br s, 1H, H Ar ) , 7.90- 
7.85 (m, 2H, H Ar ) , 7.94 (s, 1H, H-6); MS (ES) : m/z (%) 

25 (M-H) ■ 1239 (27%) . 




(45) 
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5' -O- (teirt-Butyldimethylsilyl) -7-deaza-7 -iodo-2 ' - 
deoxyguanosine (45) . 

A solution of (44) (0.55 g, 1.4 mmol) in DMF (10 ml) 
5 was treated with imidazole (190 mg, 2.8 mmol) and 
TBDMSC1 (274 mg, 1.82 mmol) at r.t. for 15h. The 
reaction was quenched with Me OH (-5 ml) . The mixture 
was evaporated to dryness. Water (-300 ml) was added 
to the residue and stirred for at least 1 h to fully 

10 dissolve imidazole. Filtration gave a brown solid, 
which was dried and purified by silica gel 
chromatography (DCM to DCM: MeOH 90:10), giving (45) 
as pale yellow powder (3 94 mg , 56%) . X H NMR (d 6 DMSO) 
8 0.00, 0.01 (2s, 6H, CH 3 ), 0.82 (s, 9H, CH 3 ) , 1.99- 

15 2.05, 2.16-2.22 (2 m, 2H, H-2'), 3,58-3.66 (m, 2H, H- 

5'), 3.72-3.74 (m, 1H, H-4'), 4.18-4.19 (m, 1H, H-3'), 
5.16 (d, J = 3.0 Hz, 1H, OH) , 6.20 (dd, J* = 6.0, 8.0 
Hz, 1H, H-l'), 6.25 (br s, 2H, NH 2 ) , 7.58 (s, 1H, H- 
8), 10.37 (s, 1H, HN) . Mass (-ve electrospray) calcd 

20 for Ci 7 H 2 7lN 4 04Si 506, found 505. 




(46) 



3' -O-Allyl-5' -O- (tert-butyldimethylsilyl) -7-deaza-7- 
iodo-2' -deoxyguanosine (46) . 

25 

A solution of (45) (354 mg, 0.7 mmol) in THF (25 ml) 
was treated with NaH (42 mg, 1.75 mmol) at r.t. for 1 
h. Allyl bromide was added and the suspension was 
stirred at r.t. for 2 days. -60% of the starting 
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material (45) was converted to the product (46) . The 
reaction was quenched with sat. aq. NaCl and extracted 
with DCM three times. The combined organic layer were 
dried (MgS0 4 ) and concentrated under vacuum. The 
5 residue was treated with TBAF in THF (1 ml) and THF (1 
ml) for 3 0 min. Evaporation to remove of THF. The 
residue was dissolved in DCM and aqueous NaHC0 3 (sat.) 
was added. The aqueous layer was extracted with DCM 
three times. The combined organics was dried over 

10 MgS0 4 and concentrated under vacuum. Purification by 
chromatography on silica (EtOAc to EtOAc : MeOH 85:15) 
gave (46) as a yellow foam (101 mg, 35%) . *H NMR (d 6 
DMSO) 5 2.15-2.31 (m, 2H, H-2'), 3.41-3.45 <m, 2H, H- 
5'), 3.82-3.85 (m, 1H, H-4'), 3.93 (d, J = 2.6 Hz f 2H, 

15 OCH 2 ) , 4.04-4.06 (m, 1H, H-3'), 4.99 (t, J = 5.4 Hz, 
OH), 5.08-5.24 (m, 2H, =CH 2 ) , 5.79-5.89 (m, 1H, CH=) , 
6.15 (dd, J = 5.9, 9.1 Hz, 1H, H-l'), 6.27 (br s, 2H, 
NH 2 ), 7.07 (s, H-8), 10.39 (s, 1H, NH) . Mass (-ve 
electrospray) calcd for Ci 4 H 17 IN 4 0 4 432, found 431. 




(47) 



3' -O-Allyl-5' -O- (tert -butyldimethylsilyl) -7-deaza-7- 
[3- (2, 2, 2-trif luoroacetamido) -prop- 1-ynyl] -2' - 
25 deoxyguanosine (47) • 



Under N 2 , a suspension of (46) (104 mg, 0.24 mmol) , 
Pd(PPh 3 ) 4 (24 mg, 0.024 mmol), Cul (9.1 mg, 0.048 
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mmol), Et 3 N (66 |iL, 0.48 mmol) and CHsCCH 2 NHCOCF 3 (89 
|4L, 0.72 mmol) in DMF (2 ml) was stirred at r.t. for 
15 h. The reaction was protected from light with 
aluminium foil. After TLC indicating the full 
5 consumption of starting material, the reaction mixture 
was concentrated. The residue was diluted with MeOH 
(20 ml) and treated with dowex-HC0 3 ~. The mixture was 
stirring for 30 min and filtered. The solution was 
concentrated and purified by silica gel chromatography 

10 (petroleum ether: EtOAc 50: 50 to petroleum ether: 
EtOAc : MeOH 40: 40: 20) giving (47) as a yellow 
powder (74 mg, 70%). *H NMR (d 6 DMSO) 6 2.15-2.39 (m, 
2H, H-2'), 3.42-3.44 (m, 2H, H-5'), 3.83-3.87 (m, 1H, 
H-4'), 3.93-3.95 (m, 2H, OCH 2 ) , 4.0-4.07 (m, 1H, H- 

15 3'), 4.15 (d, J = 5.3 HZ, 2H, sCCH 2 ) , 4.91 (t, J = 5.4 

Hz, OH), 5.08-5.24 (m, 2H, =CH 2 ) , 5.80-5.89 (m, 1H, 
CH=) , 6.15 (dd, J = 5.6, 8.9 Hz, 1H, H-l'), 6.28 (br 
s, 2H, NH 2 ) , 7.24 (s, H-8) , 9.98 (t, J = 5.3 Hz, 1H, 
NH) , 10.44 (s, 1H, NH) . Mass (-ve electrospray) calcd 
20 for C 19 H 2 oF3N 5 O s 455, found 454. 




The nucleoside (47) and proton sponge was dried over 
25 P 2 O s under vacuum overnight. A solution of (47) (73 
mg, 0.16 mmol) and proton sponge (69 mg, 0.32 mmol) 
trimethylphosphate (0.5 ml) was stirred with 4A 
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molecular sieves for 1 h. Freshly distilled P0C1 3 (18 
yil, 0.19 mmol) was added and the solution was stirred 
at 4°C for 2h. The mixture was slowly warmed up to 
room temperature and bis (tri-n-butyl ammonium) 
5 pyrophosphate (1.3 ml, 0.88 mmol) and anhydrous tri-n- 
butyl amine (0.3 ml, 1.2 8 mmol) was added. After 5 
min, the reaction was quenched with 0.1 M TEAB 
(triethylammonium bicarbonate) buffer (10 ml) and 
stirred for 3h. The water was removed under reduced - 

10 pressure and the resulting residue dissolved in 

concentrated ammonia (p 0.88, 10 ml) and stirred at 
room temperature for 16 h. The reaction mixture was 
then evaporated to dryness. The residue was dissolved 
in water and the solution applied to a DEAE-Sephadex 

15 A-25 column. MPLC was performed with a linear gradient 
of 2 L each of 0.05 M and 1 M TEAB. The triphosphate 
was eluted between 0.7 M and 0.8 M buffer. Fractions 
containing the product were combined and evaporated to 
dryness. The residue was dissolved in water and 

20 further purified by HPLC. t r (48) = 2 0.3 min (Zorbax 
C18 preparative column, gradient: 5% to 35% B in 30 
min, buffer A 0.1 M TEAB, buffer B MeCN) . The product 
(48) was isolated as a white foam (147 O.D., 19.3 
^mol, 12%, e 260 = 7, 600). X H NMR (D 2 0) 8 2.38-2.46 (m, 

25 2H, H-2'), 3.91 (m, 2H, s=CCH 2 ) , 3.98-4.07 (m, 4H, H- 

5', 2H, OCH 2 ), 4.25 (br s, 1H, H-4'), 4.40 (br s, 1H, 
H-3'), 5.16-5.30 (m, 1H, -CH 2 ) , 5.83-5.91 (m, 1H, 
=CH) , 6.23-6.27 (m, 1H, H-l'), 7.44 (s, 1H, H-8). 31 P 
NMR 8-7.1 (d, J = 16.5 Hz, IP, P y ) , -10.1 (d, J =19.9 

30 Hz, IP, P a ), -21.5 (t, J = 18.0 Hz, IP, P 3 ) . Mass (-ve 
electrospray) calcd for Ci 7 H 24 N 5 Oi 3 P3 599, found 598. 
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7-Deaza-5' -O-diphenylsilyl-7 -iodo-2 ' -deoxy adenosine 
(49) . 

5 

TBDPSC1 (0.87 g, 2.78 mmol) was added to a stirred 
solution of 7-deaza-7-iodo-2 ' -deoxyadenosine (1.05 g, 
2.78 mmol) in dry pyridine (19 ml) at 5°C under N 2 . 
After 10 min the solution was allowed to rise to room 

10 temperature and stirred for 18h. The solution was 
evaporated under reduced pressure and the residue 
purified by flash chromatography on silica (DCM to 
DCM : MeOH 19:1) . This gave the desired product (4 9) 
(1.6 g, 83%). X H NMR (d 6 DMSO) 6 1.07 (s, 9H) , 2.31- 

15 2.36 (m, 1H) , 3.76-3.80 (dd, 1H, J = 11.1, 4.7Hz), 
3.88-3.92 (dd, 1H, J = 11.2, 3.9 Hz), 3.97-4.00 (m, 
1H) , 4.49-4.50 (m, 1H) , 5.83 (s, 1H) , 6.58-6.61 (t , 
1H, J = 6.7 Hz), 7.44-7.55 (m, 6H) , 7.68-7.70 (m, 5H) , 
8.28 (s, 1H) . Mass (electrospray) calcd for 

20 C 2 7H3iIN 4 03Si 614.12, found 613. 




25 7-Deaza-6-N,tf-dimethylformadine-5' -O-diphenylsilyl-7- 
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iodo-2' -deoxyadenosine (50) . 

A solution of (49) (1.6g, 2.61 mmol) in MeOH (70 ml) 
containing dime thy lformamide dimethylacetal (6,3 g, 53 
5 mmol) was heated at 45°C for 18h. The solution was 

cooled, evaporated under reduced pressure and purified 
by flash chromatography on silica gel (EtOAc to 
EtOAc : MeOH 98:2). This resulted in 1.52 g (87%) of the 
desired product (50). X H NMR (d € DMSO) 6 0.85 (s, 9H) , 

10 2.05-2.11 (m, 1H) , 3.03 (s, 3H) , 3.06 (s, 3H) , 3.53- 
3.57 (dd, 1H, J = 11.1, 4.8 Hz), 3.65-3.69 (dd, 1H, J" 
= 11.1, 4 Hz) , 3.73-3.76 (q, 1H, J = 4 Hz) , 4.26-4.28 
(m, 1H) , 5.21-5.22 (d, 1H, J - 4.3 Hz), 6.39-6.42 (t, 
1H, J m 6.8 Hz), 7.21-7.32 (m, 6H) , 7.46 (s, 1H) , 

15 7.45-7.48 (m, 4H) , 8.15 (s, 1H) , 8.68 (s, 1H) . Mass 

(+ve electrospray) calcd for C 3 oH3 6 IN50 3 Si 669.16, found 
670. 




3 ' - O - Al ly 1 - 7 - deaz a - 6 -tf , tf- dime thy 1 f o rmadine -5' -O- 
diphenylsilyl-7 -iodo-2' -deoxyadenosine (51) . 



A solution of (50) (1.52 g, 2.28 mmol) in dry THF (5 
25 ml) was added drop wise at room temperature to a 

stirred suspension of sodium hydride (60%, 109 mg, 
2.73 mmol) in dry THF (35 ml) . After 45 min the yellow 
solution was cooled to 5°C and allyl bromide (0.413 g # 
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3.41 mmol) added. The solution was allowed to rise to 
room temperature and stirred for 18h. After adding 
isopropanol (10 drops) the solution was partitioned 
between water (5 ml) and EtOAc (50 ml) . The organic 
5 layer was separated and the aqueous solution extracted 
further with EtOAc (2x50 ml) . The combined organic 
solutions were dried (MgS0 4 ) and evaporated under 
reduced pressure. The residue was purified by flash 
chromatography on silica (petroleum ether: EtOAc 1:3 to 

10 EtOAc) to give 1.2 g (74%) of the desired product 

(51) as a gum. X H NMR (d 6 DMSO) 6 1.03 (s, 9H) , 2.39- 
2.45 (m, 1H) , 2.60-2.67 (m, 1H) , 3.2 (s, 3H) , 3.23 (s, 
3H) , 3.70-3.74 (dd, 1H, J = 11.2, 4.6 Hz), 3.83-3.87 
(dd, 1H, J = 11, 5.4 Hz), 4.03-4.08 (m, 3H) , 4.30-4.31 

15 (m, 1H) , 5.18-5.21 (m, 1H) , 5.28-5.33 (m, 1H) , 5.89- 

5.98 (m, 1H) , 6.49-6.53 (dd, 1H, J = 8.4, 5.8 Hz), 
7.41-7.51 (m, 6H) , 7.62-7.66 (m, 5H) , 8.31 (s, 1H) , 
8.85 (s, 1H) . Mass (+ve electrospray) calcd for 
C33H 4 oIN50 3 Si 709.19, found 710. 

20 




3 ' - O- Al ly 1 -7-deaza-6 -N, N- dime thy 1 f ormadine - 7 - iodo - 2 ' - 
deoxyadenosine (52) . 

25 

A 1M solution of TBAF in THF (4.4 ml, 4.4 mmol) was 
added to a solution of (51) (1.2 g, 1.69 mmol) in THF 
(100 ml) at 5°C under N 2 . The solution was allowed to 
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rise to room temperature and stirred for 2d. The 
solution was evaporated under reduced pressure and 
purified by flash chromatography on silica (EtOAc to 
EtOAc : MeOH 97:3) . This gave 593 mg (77%) of the 
5 desired product (52). *H NMR (d 6 DMSO) 6 2.54 (m, 2H) , 
3.40 (s, 3H) , 3.44 (s, 3H) , 3.72-3.8 (m, 2H) , 4.18- 
4.21 (m, 1H) , 4.23-4.27 (m, 3H) , 4.4-4.42 (d, 1H, J = 
5.7Hz), 5.35-5.41 (m, 2H) , 5.49-5.5 (q, 1H, J = 1 . 7 
Hz) , 5.53-5.55 (q, 1H, J = 1.7Hz), 6.1-6.2 (m, 1H) , 
10 6.67-6.70 (dd, 1H, i7= 8.8, 5.5 Hz), 7.96 (s, 1H) , 

8.53 (s, 1H) , 9.06 (s, 1H) . Mass (+ve electrospray) 
calcd for C 17 H22lN 5 03 471.08, found 472. 




15 

3' -0-Allyl-7-deaza-7-iodo-2' -deoxy adenosine (53) . 

A solution of (52) (593mg, 1.3 mmol) in MeOH (20 ml) 
containing 3 5% aqueous ammonia (20 ml) was heated at 

20 50°C for 2d. After cooling the solution was 

evaporated under reduced pressure and then azeotroped 
with toluene (3 x 10 ml) . This resulted in 530mg (98%) 
of the desired product (53) as a solid. X H NMR (d 6 
DMSO) 6 2.39 (m, 1H) , 3.56-3.65 (m, 2H) , 4.03-4.05 (m, 

25 1H) , 4.09-4.11 (m, 2H) , 5.23-5.25 (d, 1H, J = 10.6 

Hz), 5.35-5.4 (d, 1H, J - 15.4 Hz), 5.95-6.05 (m, 1H) , 
6.48-6.51 (dd, 1H, J"= 8.9, 5.5 Hz), 6.6-6.95 (s, 1H) , 
7.75 (s, 1H) , 8.16 (s, 1H) . Mass (+ve electrospray) 
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calcd for Ci4H 17 IN 4 03 416.03, found 417. 




5 3' -0-Allyl-7-deaza-7- [3- (2 , 2 , 2 - trif luoroacetamide) ] .-. 
2 ' - deoxy adeno s ine (54). 

To a solution of (53) (494 mg, 1.19 mmol) in dry DMF 
(17 ml) was added sequentially copper (I) iodide (45.1 

10 mg, 0.24 mmol), N-2 , 2 , 2 - trif luoro-N-prop-2 - 

ynyl ace t amide (538 mg, 3.56 mmol), Et 3 N (240 mg, 2.38 
mmol) and Pd(Ph 3 P) 4 (137 mg, 0.12 mmol) at room 
temperature. The flask was wrapped in foil to exclude 
light and stirred under N 2 for 18h. Then MeOH (10 ml) 

15 and a small spatula of dowex bicarbonate H* form were 
added and the mixture stirred for 3 0 min. The mixture 
was filtered, evaporated under reduced pressure and 
the residue triturated with MeOH to remove palladium 
salts. The filtrate was evaporated under reduced 

20 pressure and purified by flash chromatography on 

silica (DCM to DCM : MeOH 97:3). The desired product 
(54) was obtained as brown solid (490 mg, 94%) . 1 H NMR 
(d 6 DMSO) 6 2.25-2.31 (m, 1H) , 2.98-3.04 (m, 1H) , 
3.41-3.49 (m, 2H) , 3.88-3.95 (m, 3H) , 4.10-4.12 (d, 

25 1H, J = 5.2 Hz), 4.22-4.23 (d, 2H, J m 5.3 Hz), 5.07- 

5.12 (m, 2H) , 5.19-5.24 (dd, 1H, J= 17.3, 1.9 Hz), 
5.79-5.89 (m, 1H) , 6.31-6.35 (dd, 1H, J = 8.6, 5.6 
Hz), 7.69 (s, 1H) , 8.02 (S, 1H) . Mass (-ve 
electrospray) calcd for C 19 H2oF3N 5 0 4 439.15 , found 438. 
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3' -O-Allyl-7- [3-aminoprop-l-ynyl] -7-deaza-2' - 
5 deoxyadenosine 5 ' -O-nucleoside triphosphate (55)* 

The nucleoside (54) and proton sponge was dried over 
P 2 0 5 under vacuum overnight. A solution of (54) 
(84 mg, 0.191 mmol) and proton sponge (4 9 mg, 

10 0.382 mmol) in trimethylphosphate (600 was stirred 

with 4A molecular sieves for 1 h. Freshly distilled 
P0C1 3 (36 0.38 8 mmol) was added and the solution 

was stirred at 4°C for 2h. The mixture was slowly 
warmed up to room temperature and bis (tri-n-butyl 

15 ammonium) pyrophosphate 0.5 M in solution in DMF (1.52 
ml, 0.764 mmol) and anhydrous tri-n-butyl amine (364 
jil, 1.52 mmol) was added. After 5 min, the reaction 
was quenched with 0.1 M TEAB ( triethylammonium 
bicarbonate) buffer (5 ml) and stirred for 3h. The 

20 water was removed under reduced pressure and the 

resulting residue dissolved in concentrated ammonia (p 
0.88, 5 ml) and stirred at room temperature for 16 h. 
The reaction mixture was then evaporated to dryness. 
The residue was dissolved in water and the solution 

25 applied to a DEAE-Sephadex A- 2 5 column. MPLC was 
performed with a linear gradient of 0.05 M to 1 M 
TEAB. Fractions containing the product were combined 
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and evaporated to dryness. The residue was dissolved 
in water and further purified by HPLC. HPLC: t r (55) =: 
22.60 min (Zorbax C18 preparative column, gradient: 5% 
to 3 5% B in 2 0 min, buffer A 0 . 1M TEAB , buffer B MeCN) 
5 The product was isolated as a white foam (17.5 nmol , 
5.9%, 6 2 8o = 15000). X H NMR (D 2 0) 8 2.67-2.84 (2m, 2H, 
H-2'), 4.14 (br s, 2H, CH 2 NH) , 4.17-4.36 (m, 2H, H- 
5'.), 4.52 (br s, 1H, H-4'), 6.73 (t, J= 6.6 Hz, 1H, 
H-l'), 8.06 (s, 1H, H-8) , 8.19 (s, 1H, H-2). 31 P NMR 
10 (D 2 0) 8 -5.07 (d, J a 21.8 Hz, IP, P y ) , -10.19 (d, J 

= 19.8 Hz, IP, P tt ) , -21.32 (t, J = 19.8 Hz, IP, P p ) . 
Mass (-ve electrospray) calcd for Ci 5 H 21 N 8 0 12 P 3 598.05, 
found 596 




(56) 
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To the Cy3 disulphide linker (2.6 jimol) in solution in 
DMF (450 ^1) is added at 0°C 100 ^il of a mixture of 1- 
( 3 -dimethyl ami nopropyl ) - 3 -ethylcarbodiimide 
5 hydrochloride, 1 -hydroxybenzotriazole hydrate and N- 
methylmorpholine (26 jiM each) in DMF, The reaction 
mixture was stirred at room temperature for 1 h. The 
reaction was monitored by TLC (MeOH:CH 2 Cl 2 4:6) until 
all the dye linker was consumed. Then 400 |il of DMF* 

10 are added at 0°C, followed by the nucleotide (55) (3.9 
jamol) , in solution in water (100 |il) and the reaction 
mixture and stirred at room temperature overnight. TL»C 
(MeOH:CH 2 Cl 2 4:6) showed complete consumption of the 
activated ester and a dark red spot appeared on the 

15 baseline. The reaction was quenched with TEAB buffer 
(0.1M, 10 ml) and loaded on a DEAE Sephadex column (2 
x 5 cm) . The column was first eluted with 0.1 M TEAB 
buffer (100 ml) to wash off organic residues and then 
1 M TEAB buffer (100 ml) . The desired triphosphate 

20 (56) was eluted out with 1 M TEAB buffer. The fraction 
containing the product were combined, evaporated and 
purified by HPLC . HPLC conditions: t r (56)=: 21.38 min 
(Zorbax C18 preparative column, gradient: 5% to 15% B 
in lmin, then 4 min at 15% B, then 15 to 35% B in 15 

25 min, buffer A 0 . 1M TEAB, buffer B MeCN) . The product 

was isolated as dark pink solid (0 .15 |imol, 12. 5% , t S so 
= 15000). l H NMR (D 2 0) 5 2.03 (t, J = 6.4 Hz, 2H, CH 2 ) , 
2.21-2.33 (m, 1H, H-2'), 2.37-2.49 (m, 1H, H-2'), 2.50 
(t, J = 6.3 Hz, 2H, CH*) , 2.66 (t, J = 5.4 Hz, 2H, 

30 CH 2 ) , 3.79 (t, J = 6.4 Hz, 2H CH 2 ) , 3.99 (m, 4H, CH 2 N, 
H-5'), 4.18 (br s, 1H, H-4'), 6.02, 6.17 (2d, J= 13.6 
Hz, 2H, H ar ) , 6.30 (dd, J = 6.1, 8.6 Hz, H-l'), 7.08, 
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7.22 (2d, J = 7.8, 8.6 Hz, 2H, 2 X =CH) , 7.5S-7.82 (m, 
6H, 2H Ar/ H-2, H-8), 8.29 (t, J" = 13.6 Hz,, =CH) . 31 P 
NMR (D 2 0) 8 -4.83 (m, IP, P r ) , -10.06 (m, IP, P a ) , - 
20.72 (m, IP, P p ) . 

5 

Cleavage of 3'-Allyl Group in aqueous conditions 

The following shows a typical deblocking procedure for 
a 3 'blocked nucleoside in which approximately 0.5 
equivalents of Na 2 PdCl 4 and 4 equivalents of the 
10 water-soluble phosphine ligand L were employed, in 
water, at 50°C. Tfa stands for trif luoracetyl : 




15 To a solution of Ligand L (7.8 mg, 13.7|imol) in 

degassed H 2 0 (225 \xl) was added a solution of Na 2 PdCl 4 
(0.5 mg, 1.6^imol) in degassed H 2 0 (25 \xl) in an 
eppendorff vial. The two solutions were mixed well and 
after 5 min a solution of B (1 mg, 2.3^imol) in H 2 0 

20 (250 |il) was added. The reaction mixture was then 

placed in a heating block at 50 °C. The reaction could 
be followed by HPLC. Aliquot s of 50 \il were taken from 
the reaction mixture and filtered through an 
eppendorff filter vial (porosity 0 . 2jim) ; 22 \il of 

25 the solution were injected in the HPLC to monitor the 
reaction. The reaction was purified by HPLC. In a 
typical experiment the cleavage was finished (i.e. 
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>98% cleavage had occurred after 30 min) . 

3" -OH protected with a 3,4 dime thoxybenzyloxyme thy 1 
group as a protected form of a hemiacetal: 

OMe 




OMe 



Nucleotides bearing this blocking group have similar 
properties to the allyl example, though incorporate 
less rapidly. Deblocking can be achieved efficiently 
10 by the use of aqueous buffered cerium ammonium nitrate 
or DDQ, both conditions initially liberating the 
hemiacetal (1) which decomposes to the required (2) 
prior to further extension: 



O^OH * OH 

15 (1) (2) 

The 3 1 -OH may also be protected with benzyl groups 
where the phenyl group is unsubstituted, e.g. with 
benzyloxymethyl , as well as benzyl groups where the 

20 phenyl group bears electron-donating substituents; an 
example of such an electron- rich benzyl ic protecting 
group is 3,4 -dimethoxybenzyloxymethyl . 

In contrast, electron-poor benzylic protecting 
groups, such as those in which the phenyl ring is 

25 substituted with one or more nitro groups, are less 
preferred since the conditions required to form the 
intermediate groups of formulae -C(R / ) 2 -OH, -C(R') 2 - 
NH 2 , and -C(R') 2 -SH are sufficiently harsh that the 
integrity of the polynucleotide can be affected by the 
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conditions needed to deprotect such electron-poor 
benzylic protecting groups. 

3"-0H protected with a f luoromethyloxymethyl group as 
5 a protected form of a hemiacetal: 

-0-CH 2 -F 

Nucleotides bearing this blocking group may be 
converted to the intermediate hemiacetal using 
10 catalytic reactions known to those skilled in the art 
such as, for example, those using heavy metal ions 
such as silver. 



WO 2004/018497 



PCT/GB2003/003686 



- 117 - 

CLAIMS 

1. A modified nucleotide or nucleoside molecule 
comprising a purine or pyrimidine base and a ribose or 
deoxyribose sugar moiety having a removable 3 1 -OH 
5 blocking group covalently attached thereto, such that 
the 3 ' carbon atom has attached a group of the 
structure 

-0-Z 

10 wherein Z is any of -C (R' ) 2 -0-R" , -C (R' ) 2 -N (R" ) 2 , - 

C(R') 2 -N(H)R", -C(R') 2 -S-R" and -C(R'j 2 -F, 

wherein each R" is or is part of a removable 
protecting group; 

each R' is independently a hydrogen atom, an 
15 alkyl, substituted alkyl, arylalkyl, alkenyl, alkynyl, 
aryl, heteroaryl, heterocyclic, acyl , cyano, alkoxy, 
aryloxy, heteroaryloxy or amido group, or a detectable 
label attached through a linking group; or (R') 2 
represents an alkylidene group of formula =C(R''') 2 
20 wherein each R' ' * may be the same or different and is 
selected from the group comprising hydrogen and 
halogen atoms and alkyl groups; and 

wherein said molecule may be reacted to yield an 
intermediate in which each R" is exchanged for H or, 
25 where Z is -C(R') 2 -F, the F is exchanged for OH, SH or 
NH 2 , preferably OH, which intermediate dissociates 
under aqueous conditions to afford a molecule with a 
free 3 'OH; 

with the proviso that where Z is -C (R' ) 2 -S-R" , 
30 both R' groups are not H. 

2. A molecule according to claim 1 wherein R' is an 
alkyl or substituted alkyl. 
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3 . A molecule according to claim 1 or claim 2 
wherein -Z is of formula -C(R ! )-N 3 . 

4 . A molecule according to any one of claims 1 to 3 
5 wherein Z is an azidomethyl group. 

5. A molecule according to claim 1 or claim 2 
wherein R" is a benzyl or substituted benzyl group. 

10 6 . A molecule according to any preceding claim 

wherein said base is linked to a detectable label via 
a cleavable linker or a non-cleavable linker. 

7. A molecule according to claim 6 wherein said 
15 linker is cleavable. 

8 . A molecule according to any one of claims 1 to 5 
wherein a detectable label is linked to the molecule 
through the blocking group by a cleavable or non- 
20 cleavable linker. 

9. A molecule according to any one of claims 6 to 8 
wherein said detectable label is a fluorophore. 

25 10. A molecule according to any one of claims 6 to 9 
wherein said linker is acid labile, photolabile or 
contains a disulfide linkage. 

11. A modified nucleotide molecule as claimed in any 
30 one of claims 1 to 10 w hich comprises one or more 32 P 

atoms in its phosphate portion. 

12. A nucleoside, nucleotide or polynucleotide 



WO 2004/018497 



PCT/GB2003/003686 



- 119 - 

molecule of formula PN-O-allyl, wherein PN is said 
nucleoside or nucleotide or is a 3 1 terminal nucleotide 
of said polynucleotide; and said nucleoside or 
nucleotide further comprises in addition to the allyl 
5 blocking group a detectable label linked to the base 
thereof by a cleavable or non-cleavable linker. 

13. A molecule according to claim 12 wherein said 
linker is cleavable. 

10 

14. A molecule according to claim 12 or claim 13 
wherein said detectable label is a fluorophore. 

15. A molecule according to any one of claims 12 to 
15 14 wherein said linker is acid labile, photolabile or 

contains a disulfide linkage. 

16. A method of converting a compound of formula R-O- 
allyl, R 2 N(allyl), RNH(allyl), RN(allyl) 2 or R-S-allyl 

20 to a corresponding compound in which the allyl group 
is removed and replaced by hydrogen, said method 
comprising the steps of reacting a compound of formula 
R-O-allyl, R 2 N(allyl), RNH (allyl ) , RN(allyl) 2 or R-S- 
allyl in aqueous solution with a transition metal 

25 comprising a transition metal and one or more ligands 
selected from the group comprising water-soluble 
phosphine and water-soluble nitrogen-containing 
phosphine ligands, wherein the or each R is a water- 
soluble biological molecule. 

30 

17. The method of claim 16 wherein said compound is 
of formula R-O-allyl . 
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18. The method of claim* 16 or claim 17 wherein said 
R is part of a nucleoside, a nucleotide or a 
polynucleotide molecule. 

5 19. The method of claim 18 wherein said nucleoside, 
nucleotide or polynucleotide further comprises a 
detectable label linked to the base thereof by a 
cleavable or non-cleavable linker. 

10 20. A molecule according to claim 19 wherein said 
linker is cleavable. 

21. The method of claim 19 or claim 20 , wherein said 
detectable label is a fluorophore. 

15 

22. The method of any one of claims 19 to 21 wherein 
said linker is acid labile, photolabile or contains a 
di sul f ide 1 inkage . 

20 23. The method of any one of claims 19 to 22 wherein 
said allyl group and said label are removed in a 
single step. 

24. The method of any one of claims 16 to 23 wherein 
25 said transition metal is selected from the group 

comprising platinum, palladium, rhodium, ruthenium, 
osmium and iridium. 

25. The method of any one of claims 16 to 24 wherein 
30 said transition metal is palladium. 

26. The method of any one of claims 16 to 25 wherein 
said group of ligands comprise derivatised triaryl 
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phosphine ligands or derivatised trialkyl phosphine 
ligands. 

27. The method of any one of claims 16 to 26 wherein 

5 said group of ligands are derivatised with one or more 
functionalities selected from the group comprising 
amino, hydroxyl, carboxyl and sulfonate groups. 

28. The method of any one of claims 16 to 27 wherein 
10 the group of ligands comprises 3,3',3 n - 

phosphinidynetris (benzenesulf onic acid) and tris(2- 
carboxyethyl) phosphines and their salts. 

29. A method of controlling the incorporation of a 

15 nucleotide as defined in any one of claims 6 to 10 or 
as defined in any one of claims 12 to 15 and 
complementary to a second nucleotide in a target 
single-stranded polynucleotide in a synthesis or 
sequencing reaction comprising incorporating into the 

20 growing complementary polynucleotide said nucleotide, 
the incorporation of said nucleotide preventing or 
blocking introduction of subsequent nucleoside or 
nucleotide molecules into said growing complementary 
polynucleotide. 

25 

30. The method of claim 29, wherein the incorporation 
of said nucleotide is accomplished by a terminal 
transferase or polymerase or a reverse transcriptase. 

30 31. The method of claim 30 wherein the polymerase is 
a Thermococcus sp. 

32. The method of claim 31 wherein the Thermococcus 
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sp is 9°N or a single mutant or double mutant thereof. 

33. The method of claim 32 wherein the double mutant 
is -Y409V A485L. 

5 

34 . A method for determining the sequence of a target 
single-stranded polynucleotide, comprising monitoring 
the sequential incorporation of complementary 
nucleotides, wherein at least one incorporation is of 

10 a nucleotide as defined in any one of claims 6 to 10 
or as defined in any one of claims 12 to 15 and 
wherein the identity of the nucleotide incorporated is 
determined by detecting the label linked to the base, 
and the blocking group and said label are removed 

15 prior to introduction of the next complementary 
nucleotide . 

35. The method of claim 34 wherein the label of the 
nucleotide and the blocking group are removed in a 

20 single chemical treatment step. 

36. A method for determining the sequence of a target 
single -stranded polynucleotide, comprising: 

(a) providing a plurality of different 

25 nucleotides wherein said plurality of different 

nucleotides are either as defined in any one of claims 
6 to 10 or as defined in any one of claims 12 to 15 
and wherein the detectable label linked to each type 
of nucleotide can be distinguished upon detection from 

30 the detectable label used for other types of 
nucleotides; 

(b) incorporating the nucleotide into the 
complement of the target single-stranded 
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polynucleotide; 

(c) detecting the label of the nucleotide of 
(b) , thereby determining the type of nucleotide 
incorporated ; 

5 (d) removing the label of the nucleotide of (b) 

and the blocking group; and 

(e) optionally repeating steps (b) - (d) one or 
more times; 

thereby determining the sequence of a target 
10 single-stranded polynucleotide. 

37. The method of claim 36 wherein said 
incorpoirating step is accomplished by a Thermococcus 
sp. 

15 

38. The method of claim 37 wherein the Thermococcus 
sp is 9°N or a single mutant or double mutant thereof. 

39. The method of claim 38 wherein the double mutant 
20 is -Y409V A485L. 

40. The method of any one of claims 36 to 3 9 wherein 
the label of the nucleotide and the blocking group are 
removed in a single chemical treatment step. 

25 

41. A method according to any one of claims 3 6 to 40, 
wherein each of the nucleotides are brought into 
contact with the target sequentially, with removal of 
non- incorporated nucleotides prior to addition of the 

30 next nucleotide, and wherein detection and removal of 
the label and the blocking group is carried out either 
after addition of each nucleotide, or after addition 
of all four nucleotides. 
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42. The method according to any one of claims 36 to 
40 , wherein each of the nucleotides are brought into 
contact with the target together simultaneously, and 

5 non- incorporated nucleotides are removed prior to 

detection and subsequent to removal of the label and 
the blocking group. 

43. The method according to any one of claims 36 to 

10 40, comprising a first step and a second step, wherein 
in the first step, a first composition comprising two 
of the four nucleotides is brought into contact with 
the target and non- incorporated nucleotides are 
removed prior to detection and subsequent to removal 

15 of the label, and wherein in the second step, a second 
composition comprising the two nucleotides not 
included in the first composition is brought into 
contact with the target, and non- incorporated 
nucleotides are removed prior to detection and 

20 subsequent to removal of the label and blocking group, 
and wherein the first and second steps are optionally 
repeated one or more times. 

44. The method according to any one of claims 36 to 
25 40, comprising a first step and a second step, wherein 

in the first step, a composition comprising one of the 
four nucleotides is brought into contact with the 
target, and non- incorporated nucleotides are removed 
prior to detection and subsequent to removal of the 
30 label and blocking group and wherein in the second 
step, a second composition comprising the three 
nucleotides not included in the first composition is 
brought into contact with the target, and non- 
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incorporated nucleotides are removed prior to 
detection and subsequent to removal of the label and 
blocking group and wherein the first steps and the 
second step are optionally repeated one or more times. 

5 

45. . The method according to any one of claims 36 to 
40, comprising a first step and a second step, wherein 
in the first step, a first composition comprising 
three of the four nucleotides is brought into contact 

10 with the target, and non- incorporated nucleotides are 
removed prior to detection and subsequent to removal 
of the label and blocking group and wherein in the 
second step, a composition comprising the nucleotide 
not included in the first composition is brought into 

15 contact with the target, and non- incorporated 
nucleotides are removed prior to detection and 
subsequent to removal of the label and blocking group 
and wherein the first steps and the second step are 
optionally repeated one or more times. 

20 

46. A kit, comprising: 

(a) a a plurality of different nucleotides 
wherein said plurality of different nucleotides are 
either as defined in any one of claims 6 to 10 or as 

25 defined in any one of claims 12 to 15; and 

(b) packaging materials therefor. 

47. A kit according to claim 46, wherein the 
detectable label in each nucleotide can be 

30 distinguished upon detection from the detectable label 
used for any of the other three types of nucleotide. 

48. The kit of claim 46 or 47, further comprising an 
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enzyme and buffers appropriate for the action of the 
enzyme . 

49. Use of a nucleotide as defined in any one of 

5 claims 1 to 15 in a Sanger or a Sanger-type sequencing 
method. 

50. A method of using a nucleotide of claim 1 wherein 
said method includes a Sanger or Sanger-type 

10 sequencing method. 
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where R, and R 2 . which may be the 
same or dWferent are each selected 
irom H, OH, or any group which can be 
transformed into an OH. Suitable groups 
for R, and R 2 are described in Figure 3 

X» H, phosphate, diphosphate or triphosphate 
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Label Cleavable linker Base, 



fc — I B 

Cleavable linkers may include: 
L 

■JS— s 
B 



where R! and R 2 , which may be the same or 
different, are each selected from H, OH, or 
any group which can be transformed into an 
OH, including a carbonyl 



B^N 




X 



B. N 



L B ^ N 



R3 represents one or more 
substituents independently 
selected from alkyl, alkoxy, 
amino or halogen 



Alternatively, cleavable linkers may 
be constructed from any labile 
functionality used on the 3'-block 



R^J? R^O 
R« R« 



Rf^F 



Ri and R 2 groups may include 
O 



II o*S 



>5 
R4 N 3 



Xh~^> 



-o 

D^O' R 6 0 ^N' R 6 o^S-** 

r^s: R 4 ^t 



0 ^R $ Jj^Rj 



where R 4 is H or alkyl, R5 is H or 
alkyl and R 6 is alkyl, cycloalkyl, 
alkenyl, cycloalkenyl or benzyl 

and X is H. phosphate, diphosphate or triphosphate 



Fig. 3 
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Fig. 6 
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